Antisense modulation of NOD1 expression

ABSTRACT

Antisense compounds, compositions and methods are provided for modulating the expression of NOD1. The compositions comprise antisense compounds, particularly antisense oligonucleotides, targeted to nucleic acids encoding NOD1. Methods of using these compounds for modulation of NOD1 expression and for treatment of diseases associated with expression of NOD1 are provided.

FIELD OF THE INVENTION

[0001] The present invention provides compositions and methods for modulating the expression of NOD1. In particular, this invention relates to compounds, particularly oligonucleotides, specifically hybridizable with nucleic acids encoding NOD1. Such compounds have been shown to modulate the expression of NOD1.

BACKGROUND OF THE INVENTION

[0002] Apoptosis, or programmed cell death, is a naturally occurring process that has been strongly conserved during evolution to prevent uncontrolled cell proliferation. This form of cell suicide plays a crucial role in ensuring the development and maintenance of multicellular organisms by eliminating superfluous or unwanted cells. However, if this process becomes overstimulated, cell loss and degenerative disorders including neurological disorders such as Alzheimers, Parkinsons, ALS, retinitis pigmentosa and blood cell disorders can result. Stimuli which can trigger apoptosis include growth factors such as tumor necrosis factor (TNF), Fas and transforming growth factor beta (TGFβ), neurotransmitters, growth factor withdrawal, loss of extracellular matrix attachment and extreme fluctuations in intracellular calcium levels (Afford and Randhawa, Mol. Pathol., 2000, 53, 55-63).

[0003] Alternatively, insufficient apoptosis, triggered by growth factors, extracellular matrix changes, CD40 ligand, viral gene products neutral amino acids, zinc, estrogen and androgens, can contribute to the development of cancer, autoimmune disorders and viral infections (Afford and Randhawa, Mol. Pathol., 2000, 53, 55-63). Consequently, apoptosis is regulated under normal circumstances by the interaction of gene products that either induce or inhibit cell death and several gene products which modulate the apoptotic process have now been identified.

[0004] The most well-characterized apoptotic signaling cascade to date is that orchestrated by a family of cysteine proteases known as caspases. These enzymes activate apoptosis through proteolytic events triggered by one of several described mechanisms; including ligand binding to the cell surface death receptors of either the TNF or NGF receptor families, changes in mitochondrial integrity or chemical induction (Thornberry, British Medical Bulletin, 1997, 53, 478-490).

[0005] The proteins that control the other less understood caspase activation pathways often exist as families that can be recognized based on their amino acid sequence and/or structural similarity. Moreover, interactions among these proteins are commonly mediated by domains that are intimately associated with apoptosis regulation including death domains (DDs), death effector domains (DEDs), caspase-associated recruitment domains (CARDs) and nucleotide-binding oligomerization domains (NODs) (Inohara and Nunez, Oncogene, 2001, 20, 6473-6481; Reed, Am. J. Pathol., 2000, 157, 1415-1430).

[0006] The overall structure of the CARD is comprised of six alpha helices (Reed, Am. J. Pathol., 2000, 157, 1415-1430). Homotypic interactions among CARD-carrying proteins play important roles in caspase activation throughout evolution (Reed, Am. J. Pathol., 2000, 157, 1415-1430). Several pro-caspases contain N-terminal CARDs in their pro-domains including caspases 1, 2, 4, 5 and 9 in humans and caspases 1, 2, 9, 11 and 12 in mice (Reed, Am. J. Pathol., 2000, 157, 1415-1430). Additional CARD-carrying proteins involved in caspase activation include: CED-4, ApafI, NOD1, NOD2, Bcl-10 (huE10, CIPER) and RICK (Reed, Am. J. Pathol., 2000, 157, 1415-1430).

[0007] The NOD is another domain which plays a role in activation of diverse signaling pathways involved in the elimination of cells via programmed cell death and host defense against pathogens. It is found in several of the proteins which also contain a CARD, including Apaf-1, CED-4, CIITA, NAIP, DefCap, NALP2, NOD1 and NOD2 (Inohara and Nunez, Oncogene, 2001, 20, 6473-6481).

[0008] NOD1 (also known as caspase recruitment domain 4; CARD4) contains a CARD and a NOD and was cloned by two different research groups in 1999 (Bertin et al., J. Biol. Chem., 1999, 274, 12955-12958; Inohara et al., J. Biol. Chem., 1999, 274, 14560-14567). It is widely expressed as a 4.5 kb transcript with highest levels in heart, spleen and lung as well as in numerous cancer cell lines and fetal tissues (Bertin et al., J. Biol. Chem., 1999, 274, 12955-12958; Inohara et al., J. Biol. Chem., 1999, 274, 14560-14567). Inohara et al. reported that the NOD1 gene contains seven coding and seven non-coding exons and maps to chromosome 7p15-p14 (Inohara et al., J. Biol. Chem., 1999, 274, 14560-14567).

[0009] A nucleic acid sequence encoding NOD1 (CARD-4) is disclosed in U.S. Pat. No. 6,033,855 (Bertin, 2000). Disclosed and claimed in PCT publication WO 01/00826 are nucleic acid sequences encoding four different forms of NOD1, one long form called CARD4-L, one short form called CARD4-S and two splice variants called CARD4-Y and CARD4-Z and nucleic acid molecules which hybridize to said nucleic acid sequences or a complement thereof under stringent conditions (Bertin and Robison, 2001).

[0010] Additionally, a third splice variant has been identified and is herein designated CARD4-X.

[0011] NOD1 has been implicated as an activator of nuclear factor kappa-B and has been found to bind a number of caspases (Bertin et al., J. Biol. Chem., 1999, 274, 12955-12958; Inohara et al., J. Biol. Chem., 1999, 274, 14560-14567). It was found to specifically activate caspase-9 and promote caspase-9-induced apoptosis (Inohara et al., J. Biol. Chem., 1999, 274, 14560-14567).

[0012] More recently, NOD1 has been proposed to function as a cytosolic receptor for pathogen components derived from invading bacteria and mediates intracellular pathogen recognition and signal transduction (Girardin et al., EMBO Rep., 2001, 2, 736-742; Inohara et al., J. Biol. Chem., 2001, 276, 2551-2554).

[0013] Currently there exists a need to identify methods of modulating apoptosis for the therapeutic treatment of human diseases and it is believed that agents capable of modulating the expression of regulatory proteins and adaptor proteins involved in caspase apoptosis signaling cascades will be integral to these methods.

[0014] Currently, there are no known therapeutic agents that effectively inhibit the synthesis of NOD1. Consequently, there remains a long felt need for additional agents capable of effectively inhibiting NOD1 function.

[0015] Antisense technology is emerging as an effective means for reducing the expression of specific gene products and may therefore prove to be uniquely useful in a number of therapeutic, diagnostic, and research applications for the modulation of expression of NOD1.

[0016] The present invention provides compositions and methods for modulating expression of NOD1, including modulation of variants of NOD1.

SUMMARY OF THE INVENTION

[0017] The present invention is directed to compounds, particularly antisense oligonucleotides, which are targeted to a nucleic acid encoding NOD1, and which modulate the expression of NOD1. Pharmaceutical and other compositions comprising the compounds of the invention are also provided. Further provided are methods of modulating the expression of NOD1 in cells or tissues comprising contacting said cells or tissues with one or more of the antisense compounds or compositions of the invention. Further provided are methods of treating an animal, particularly a human, suspected of having or being prone to a disease or condition associated with expression of NOD1 by administering a therapeutically or prophylactically effective amount of one or more of the antisense compounds or compositions of the invention.

DETAILED DESCRIPTION OF THE INVENTION

[0018] The present invention employs oligomeric compounds, particularly antisense oligonucleotides, for use in modulating the function of nucleic acid molecules encoding NOD1, ultimately modulating the amount of NOD1 produced. This is accomplished by providing antisense compounds which specifically hybridize with one or more nucleic acids encoding NOD1. As used herein, the terms “target nucleic acid” and “nucleic acid encoding NOD1” encompass DNA encoding NOD1, RNA (including pre-mRNA and mRNA) transcribed from such DNA, and also cDNA derived from such RNA. The specific hybridization of an oligomeric compound with its target nucleic acid interferes with the normal function of the nucleic acid. This modulation of function of a target nucleic acid by compounds which specifically hybridize to it is generally referred to as “antisense”. The functions of DNA to be interfered with include replication and transcription. The functions of RNA to be interfered with include all vital functions such as, for example, translocation of the RNA to the site of protein translation, translation of protein from the RNA, splicing of the RNA to yield one or more mRNA species, and catalytic activity which may be engaged in or facilitated by the RNA. The overall effect of such interference with target nucleic acid function is modulation of the expression of NOD1. In the context of the present invention, “modulation” means either an increase (stimulation) or a decrease (inhibition) in the expression of a gene. In the context of the present invention, inhibition is the preferred form of modulation of gene expression and mRNA is a preferred target.

[0019] It is preferred to target specific nucleic acids for antisense. “Targeting” an antisense compound to a particular nucleic acid, in the context of this invention, is a multistep process. The process usually begins with the identification of a nucleic acid sequence whose function is to be modulated. This may be, for example, a cellular gene (or mRNA transcribed from the gene) whose expression is associated with a particular disorder or disease state, or a nucleic acid molecule from an infectious agent. In the present invention, the target is a nucleic acid molecule encoding NOD1. The targeting process also includes determination of a site or sites within this gene for the antisense interaction to occur such that the desired effect, e.g., detection or modulation of expression of the protein, will result. Within the context of the present invention, a preferred intragenic site is the region encompassing the translation initiation or termination codon of the open reading frame (ORF) of the gene. Since, as is known in the art, the translation initiation codon is typically 5′-AUG (in transcribed mRNA molecules; 5′-ATG in the corresponding DNA molecule), the translation initiation codon is also referred to as the “AUG codon,” the “start codon” or the “AUG start codon”. A minority of genes have a translation initiation codon having the RNA sequence 5′-GUG, 5′-UUG or 5′-CUG, and 5′-AUA, 5′-ACG and 5′-CUG have been shown to function in vivo. Thus, the terms “translation initiation codon” and “start codon” can encompass many codon sequences, even though the initiator amino acid in each instance is typically methionine (in eukaryotes) or formylmethionine (in prokaryotes). It is also known in the art that eukaryotic and prokaryotic genes may have two or more alternative start codons, any one of which may be preferentially utilized for translation initiation in a particular cell type or tissue, or under a particular set of conditions. In the context of the invention, “start codon” and “translation initiation codon” refer to the codon or codons that are used in vivo to initiate translation of an mRNA molecule transcribed from a gene encoding NOD1, regardless of the sequence(s) of such codons.

[0020] It is also known in the art that a translation termination codon (or “stop codon”) of a gene may have one of three sequences, i.e., 5′-UAA, 5′-UAG and 5′-UGA (the corresponding DNA sequences are 5′-TAA, 5′-TAG and 5′-TGA, respectively). The terms “start codon region” and “translation initiation codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation initiation codon. Similarly, the terms “stop codon region” and “translation termination codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation termination codon.

[0021] The open reading frame (ORF) or “coding region,” which is known in the art to refer to the region between the translation initiation codon and the translation termination codon, is also a region which may be targeted effectively. Other target regions include the 5′ untranslated region (5′UTR), known in the art to refer to the portion of an mRNA in the 5′ direction from the translation initiation codon, and thus including nucleotides between the 5′ cap site and the translation initiation codon of an mRNA or corresponding nucleotides on the gene, and the 3′ untranslated region (3′UTR), known in the art to refer to the portion of an mRNA in the 3′ direction from the translation termination codon, and thus including nucleotides between the translation termination codon and 3′ end of an mRNA or corresponding nucleotides on the gene. The 5′ cap of an mRNA comprises an N7-methylated guanosine residue joined to the 5′-most residue of the mRNA via a 5′-5′ triphosphate linkage. The 5′ cap region of an mRNA is considered to include the 5′ cap structure itself as well as the first 50 nucleotides adjacent to the cap. The 5′ cap region may also be a preferred target region.

[0022] Although some eukaryotic mRNA transcripts are directly translated, many contain one or more regions, known as “introns,” which are excised from a transcript before it is translated. The remaining (and therefore translated) regions are known as “exons” and are spliced together to form a continuous mRNA sequence. mRNA splice sites, i.e., intron-exon junctions, may also be preferred target regions, and are particularly useful in situations where aberrant splicing is implicated in disease, or where an overproduction of a particular mRNA splice product is implicated in disease. Aberrant fusion junctions due to rearrangements or deletions are also preferred targets. It has also been found that introns can also be effective, and therefore preferred, target regions for antisense compounds targeted, for example, to DNA or pre-mRNA.

[0023] It is also known in the art that alternative RNA transcripts can be produced from the same genomic region of DNA. These alternative transcripts are generally known as “variants”. More specifically, “pre-mRNA variants” are transcripts produced from the same genomic DNA that differ from other transcripts produced from the same genomic DNA in either their start or stop position and contain both intronic and extronic regions.

[0024] Upon excision of one or more exon or intron regions or portions thereof during splicing, pre-mRNA variants produce smaller “mRNA variants”. Consequently, mRNA variants are processed pre-mRNA variants and each unique pre-mRNA variant must always produce a unique mRNA variant as a result of splicing. These mRNA variants are also known as “alternative splice variants”. If no splicing of the pre-mRNA variant occurs then the pre-mRNA variant is identical to the mRNA variant.

[0025] It is also known in the art that variants can be produced through the use of alternative signals to start or stop transcription and that pre-mRNAs and mRNAs can possess more that one start codon or stop codon. Variants that originate from a pre-mRNA or mRNA that use alternative start codons are known as “alternative start variants” of that pre-mRNA or mRNA. Those transcripts that use an alternative stop codon are known as “alternative stop variants” of that pre-mRNA or mRNA. One specific type of alternative stop variant is the “polyA variant” in which the multiple transcripts produced result from the alternative selection of one of the “polyA stop signals” by the transcription machinery, thereby producing transcripts that terminate at unique polyA sites.

[0026] Once one or more target sites have been identified, oligonucleotides are chosen which are sufficiently complementary to the target, i.e., hybridize sufficiently well and with sufficient specificity, to give the desired effect.

[0027] In the context of this invention, “hybridization” means hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases. For example, adenine and thymine are complementary nucleobases which pair through the formation of hydrogen bonds. “Complementary,” as used herein, refers to the capacity for precise pairing between two nucleotides. For example, if a nucleotide at a certain position of an oligonucleotide is capable of hydrogen bonding with a nucleotide at the same position of a DNA or RNA molecule, then the oligonucleotide and the DNA or RNA are considered to be complementary to each other at that position. The oligonucleotide and the DNA or RNA are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleotides which can hydrogen bond with each other. Thus, “specifically hybridizable” and “complementary” are terms which are used to indicate a sufficient degree of complementarity or precise pairing such that stable and specific binding occurs between the oligonucleotide and the DNA or RNA target. It is understood in the art that the sequence of an antisense compound need not be 100% complementary to that of its target nucleic acid to be specifically hybridizable. An antisense compound is specifically hybridizable when binding of the compound to the target DNA or RNA molecule interferes with the normal function of the target DNA or RNA to cause a loss of utility, and there is a sufficient degree of complementarity to avoid non-specific binding of the antisense compound to non-target sequences under conditions in which specific binding is desired, i.e., under physiological conditions in the case of in vivo assays or therapeutic treatment, and in the case of in vitro assays, under conditions in which the assays are performed.

[0028] Antisense and other compounds of the invention which hybridize to the target and inhibit expression of the target are identified through experimentation, and the sequences of these compounds are hereinbelow identified as preferred embodiments of the invention. The target sites to which these preferred sequences are complementary are hereinbelow referred to as “active sites” and are therefore preferred sites for targeting. Therefore another embodiment of the invention encompasses compounds which hybridize to these active sites.

[0029] Antisense compounds are commonly used as research reagents and diagnostics. For example, antisense oligonucleotides, which are able to inhibit gene expression with exquisite specificity, are often used by those of ordinary skill to elucidate the function of particular genes. Antisense compounds are also used, for example, to distinguish between functions of various members of a biological pathway. Antisense modulation has, therefore, been harnessed for research use.

[0030] For use in kits and diagnostics, the antisense compounds of the present invention, either alone or in combination with other antisense compounds or therapeutics, can be used as tools in differential and/or combinatorial analyses to elucidate expression patterns of a portion or the entire complement of genes expressed within cells and tissues.

[0031] Expression patterns within cells or tissues treated with one or more antisense compounds are compared to control cells or tissues not treated with antisense compounds and the patterns produced are analyzed for differential levels of gene expression as they pertain, for example, to disease association, signaling pathway, cellular localization, expression level, size, structure or function of the genes examined. These analyses can be performed on stimulated or unstimulated cells and in the presence or absence of other compounds which affect expression patterns.

[0032] Examples of methods of gene expression analysis known in the art include DNA arrays or microarrays (Brazma and Vilo, FEBS Lett., 2000, 480, 17-24; Celis, et al., FEBS Lett., 2000, 480, 2-16), SAGE (serial analysis of gene expression) (Madden, et al., Drug Discov. Today, 2000, 5, 415-425), READS (restriction enzyme amplification of digested cDNAs) (Prashar and Weissman, Methods Enzymol., 1999, 303, 258-72), TOGA (total gene expression analysis) (Sutcliffe, et al., Proc. Natl. Acad. Sci. U.S.A., 2000, 97, 1976-81), protein arrays and proteomics (Celis, et al., FEBS Lett., 2000, 480, 2-16; Jungblut, et al., Electrophoresis, 1999, 20, 2100-10), expressed sequence tag (EST) sequencing (Celis, et al., FEBS Lett., 2000, 480, 2-16; Larsson, et al., J. Biotechnol., 2000, 80, 143-57), subtractive RNA fingerprinting (SuRF) (Fuchs, et al., Anal. Biochem., 2000, 286, 91-98; Larson, et al., Cytometry, 2000, 41, 203-208), subtractive cloning, differential display (DD) (Jurecic and Belmont, Curr. Opin. Microbiol., 2000, 3, 316-21), comparative genomic hybridization (Carulli, et al., J. Cell Biochem. Suppl., 1998, 31, 286-96), FISH (fluorescent in situ hybridization) techniques (Going and Gusterson, Eur. J. Cancer, 1999, 35, 1895-904) and mass spectrometry methods (reviewed in (To, Comb. Chem. High Throughput Screen, 2000, 3, 235-41).

[0033] The specificity and sensitivity of antisense is also harnessed by those of skill in the art for therapeutic uses. Antisense oligonucleotides have been employed as therapeutic moieties in the treatment of disease states in animals and man. Antisense oligonucleotide drugs, including ribozymes, have been safely and effectively administered to humans and numerous clinical trials are presently underway. It is thus established that oligonucleotides can be useful therapeutic modalities that can be configured to be useful in treatment regimes for treatment of cells, tissues and animals, especially humans.

[0034] In the context of this invention, the term “oligonucleotide” refers to an oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or mimetics thereof. This term includes oligonucleotides composed of naturally-occurring nucleobases, sugars and covalent internucleoside (backbone) linkages as well as oligonucleotides having non-naturally-occurring portions which function similarly. Such modified or substituted oligonucleotides are often preferred over native forms because of desirable properties such as, for example, enhanced cellular uptake, enhanced affinity for nucleic acid target and increased stability in the presence of nucleases.

[0035] While antisense oligonucleotides are a preferred form of antisense compound, the present invention comprehends other oligomeric antisense compounds, including but not limited to oligonucleotide mimetics such as are described below. The antisense compounds in accordance with this invention preferably comprise from about 8 to about 50 nucleobases (i.e. from about 8 to about 50 linked nucleosides). Particularly preferred antisense compounds are antisense oligonucleotides, even more preferably those comprising from about 12 to about 30 nucleobases. Antisense compounds include ribozymes, external guide sequence (EGS) oligonucleotides (oligozymes), and other short catalytic RNAs or catalytic oligonucleotides which hybridize to the target nucleic acid and modulate its expression.

[0036] As is known in the art, a nucleoside is a base-sugar combination. The base portion of the nucleoside is normally a heterocyclic base. The two most common classes of such heterocyclic bases are the purines and the pyrimidines. Nucleotides are nucleosides that further include a phosphate group covalently linked to the sugar portion of the nucleoside. For those nucleosides that include a pentofuranosyl sugar, the phosphate group can be linked to either the 2′, 3′ or 5′ hydroxyl moiety of the sugar. In forming oligonucleotides, the phosphate groups covalently link adjacent nucleosides to one another to form a linear polymeric compound. In turn the respective ends of this linear polymeric structure can be further joined to form a circular structure, however, open linear structures are generally preferred. Within the oligonucleotide structure, the phosphate groups are commonly referred to as forming the internucleoside backbone of the oligonucleotide. The normal linkage or backbone of RNA and DNA is a 3′ to 5′ phosphodiester linkage.

[0037] Specific examples of preferred antisense compounds useful in this invention include oligonucleotides containing modified backbones or non-natural internucleoside linkages. As defined in this specification, oligonucleotides having modified backbones include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone. For the purposes of this specification, and as sometimes referenced in the art, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides.

[0038] Preferred modified oligonucleotide backbones include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates, 5′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates and boranophosphates having normal 3′-5′ linkages, 2′-5′ linked analogs of these, and those having inverted polarity wherein one or more internucleotide linkages is a 3′ to 3′, 5′ to 5′ or 2′ to 2′ linkage. Preferred oligonucleotides having inverted polarity comprise a single 3′ to 3′ linkage at the 3′-most internucleotide linkage i.e. a single inverted nucleoside residue which may be abasic (the nucleobase is missing or has a hydroxyl group in place thereof). Various salts, mixed salts and free acid forms are also included.

[0039] Representative United States patents that teach the preparation of the above phosphorus-containing linkages include, but are not limited to, U.S.: U.S. Pat. Nos. 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; 5,194,599; 5,565,555; 5,527,899; 5,721,218; 5,672,697 and 5,625,050, certain of which are commonly owned with this application, and each of which is herein incorporated by reference.

[0040] Preferred modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; riboacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH₂ component parts.

[0041] Representative United States patents that teach the preparation of the above oligonucleosides include, but are not limited to, U.S.: U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; 5,792,608; 5,646,269 and 5,677,439, certain of which are commonly owned with this application, and each of which is herein incorporated by reference.

[0042] In other preferred oligonucleotide mimetics, both the sugar and the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the preparation of PNA compounds include, but are not limited to, U.S.: U.S. Pat. Nos. 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found in Nielsen et al., Science, 1991, 254, 1497-1500.

[0043] Most preferred embodiments of the invention are oligonucleotides with phosphorothioate backbones and oligonucleosides with heteroatom backbones, and in particular —CH₂—NH—O—CH₂—, —CH₂—N(CH₃)—O—CH₂— [known as a methylene (methylimino) or MMI backbone], —CH₂—O—N(CH₃)—CH₂—, —CH₂—N(CH₃)—N(CH₃)—CH₂— and —O—N(CH₃)—CH₂—CH₂— [wherein the native phosphodiester backbone is represented as —O—P—O—CH₂—] of the above referenced U.S. Pat. No. 5,489,677, and the amide backbones of the above referenced U.S. Pat. No. 5,602,240. Also preferred are oligonucleotides having morpholino backbone structures of the above-referenced U.S. Pat. No. 5,034,506.

[0044] Modified oligonucleotides may also contain one or more substituted sugar moieties. Preferred oligonucleotides comprise one of the following at the 2′ position: OH; F; O—, S—, or N-alkyl; O—, S—, or N-alkenyl; O—, S— or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubstituted C₁ to C₁₀ alkyl or C₂ to C₁₀ alkenyl and alkynyl. Particularly preferred are O[(CH₂)_(n)O]_(m)CH₃, O(CH₂)_(n)OCH₃, O(CH₂)_(n)NH₂, O(CH₂)_(n)CH₃, O(CH₂)_(n)ONH₂, and O(CH₂)_(n)ON[(CH₂)_(n)CH₃)]₂, where n and m are from 1 to about 10. Other preferred oligonucleotides comprise one of the following at the 2′ position: C₁ to C₁₀ lower alkyl, substituted lower alkyl, alkenyl, alkynyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH₃, OCN, Cl, Br, CN, CF₃, OCF₃, SOCH₃, SO₂CH₃, ONO₂, NO₂, N₃, NH₂, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties. A preferred modification includes 2′-methoxyethoxy (2′-O—CH₂CH₂OCH₃, also known as 2′-O-(2-methoxyethyl) or 2′-MOE) (Martin et al., Helv. Chim. Acta, 1995, 78, 486-504) i.e., an alkoxyalkoxy group. A further preferred modification includes 2′-dimethylaminooxyethoxy, i.e., a O(CH₂)₂ON(CH₃)₂ group, also known as 2′-DMAOE, as described in examples hereinbelow, and 2′-dimethylaminoethoxyethoxy (also known in the art as 2′-O-dimethylaminoethoxyethyl or 2′-DMAEOE), i.e., 2′-O—CH₂—O—CH₂—N(CH₂)₂, also described in examples hereinbelow.

[0045] A further prefered modification includes Locked Nucleic Acids (LNAS) in which the 2′-hydroxyl group is linked to the 3′ or 4′ carbon atom of the sugar ring thereby forming a bicyclic sugar moiety. The linkage is preferably a methelyne (—CH₂—)_(n) group bridging the 2′ oxygen atom and the 4′ carbon atom wherein n is 1 or 2. LNAs and preparation thereof are described in WO 98/39352 and WO 99/14226.

[0046] Other preferred modifications include 2′-methoxy (2′-O—CH₃), 2′-aminopropoxy (2′-OCH₂CH₂CH₂NH₂), 2′-allyl (2′-CH₂—CH═CH₂), 2′-O-allyl (2′-O—CH₂—CH═CH₂) and 2′-fluoro (2′-F). The 2′-modification may be in the arabino (up) position or ribo (down) position. A preferred 2′-arabino modification is 2′-F. Similar modifications may also be made at other positions on the oligonucleotide, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked oligonucleotides and the 5′ position of 5′ terminal nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. Representative United States patents that teach the preparation of such modified sugar structures include, but are not limited to, U.S.: U.S. Pat. Nos. 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 5,658,873; 5,670,633; 5,792,747; and 5,700,920, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

[0047] Oligonucleotides may also include nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl (—C≡C—CH₃) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 2-F-adenine, 2-amino-adenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further modified nucleobases include tricyclic pyrimidines such as phenoxazine cytidine(1H-pyrimido[5,4-b][1,4]benzoxazin-2(3H)-one), phenothiazine cytidine (1H-pyrimido[5,4-b][1,4]benzothiazin-2(3H)-one), G-clamps such as a substituted phenoxazine cytidine (e.g. 9-(2-aminoethoxy)-H-pyrimido[5,4-b][1,4]benzoxazin-2(3H)-one), carbazole cytidine (2H-pyrimido[4,5-b]indol-2-one), pyridoindole cytidine (H-pyrido[3′,2′:4,5]pyrrolo[2,3-d]pyrimidin-2-one). Modified nucleobases may also include those in which the purine or pyrimidine base is replaced with other heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2-aminopyridine and 2-pyridone. Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B. ed., CRC Press, 1993. Certain of these nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds of the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. (Sanghvi, Y. S., Crooke, S. T. and Lebleu, B., eds., Antisense Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are presently preferred base substitutions, even more particularly when combined with 2′-O-methoxyethyl sugar modifications.

[0048] Representative United States patents that teach the preparation of certain of the above noted modified nucleobases as well as other modified nucleobases include, but are not limited to, the above noted U.S. Pat. No. 3,687,808, as well as U.S.: U.S. Pat. Nos. 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; 5,645,985; 5,830,653; 5,763,588; 6,005,096; and 5,681,941, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference, and U.S. Pat. No. 5,750,692, which is commonly owned with the instant application and also herein incorporated by reference.

[0049] Another modification of the oligonucleotides of the invention involves chemically linking to the oligonucleotide one or more moieties or conjugates which enhance the activity, cellular distribution or cellular uptake of the oligonucleotide. The compounds of the invention can include conjugate groups covalently bound to functional groups such as primary or secondary hydroxyl groups. Conjugate groups of the invention include intercalators, reporter molecules, polyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of oligomers. Typical conjugates groups include cholesterols, lipids, phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluoresceins, rhodamines, coumarins, and dyes. Groups that enhance the pharmacodynamic properties, in the context of this invention, include groups that improve oligomer uptake, enhance oligomer resistance to degradation, and/or strengthen sequence-specific hybridization with RNA. Groups that enhance the pharmacokinetic properties, in the context of this invention, include groups that improve oligomer uptake, distribution, metabolism or excretion. Representative conjugate groups are disclosed in International Patent Application PCT/US92/09196, filed Oct. 23, 1992 the entire disclosure of which is incorporated herein by reference. Conjugate moieties include but are not limited to lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let., 1994, 4, 1053-1060), a thioether, e.g., hexyl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci., 1992, 660, 306-309; Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a thiocholesterol (Oberhauser et al., Nucl. Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., dodecandiol or undecyl residues (Saison-Behmoaras et al., EMBO J., 1991, 10, 1111-1118; Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., Biochimie, 1993, 75, 49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethyl-ammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654; Shea et al., Nucl. Acids Res., 1990, 18, 3777-3783), a polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 1264, 229-237), or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. Ther., 1996, 277, 923-937. Oligonucleotides of the invention may also be conjugated to active drug substances, for example, aspirin, warfarin, phenylbutazone, ibuprofen, suprofen, fenbufen, ketoprofen, (S)-(+)-pranoprofen, carprofen, dansylsarcosine, 2,3,5-triiodobenzoic acid, flufenamic acid, folinic acid, a benzothiadiazide, chlorothiazide, a diazepine, indomethicin, a barbiturate, a cephalosporin, a sulfa drug, an antidiabetic, an antibacterial or an antibiotic. Oligonucleotide-drug conjugates and their preparation are described in U.S. patent application Ser. No. 09/334,130 (filed Jun. 15, 1999) which is incorporated herein by reference in its entirety.

[0050] Representative United States patents that teach the preparation of such oligonucleotide conjugates include, but are not limited to, U.S.: U.S. Pat. Nos. 4,828,979; 4,948,882; 5,218,105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 5,580,731; 5,591,584; 5,109,124; 5,118,802; 5,138,045; 5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762,779; 4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,112,963; 5,214,136; 5,082,830; 5,112,963; 5,214,136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 5,688,941, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference.

[0051] It is not necessary for all positions in a given compound to be uniformly modified, and in fact more than one of the aforementioned modifications may be incorporated in a single compound or even at a single nucleoside within an oligonucleotide. The present invention also includes antisense compounds which are chimeric compounds. “Chimeric” antisense compounds or “chimeras,” in the context of this invention, are antisense compounds, particularly oligonucleotides, which contain two or more chemically distinct regions, each made up of at least one monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as to confer upon the oligonucleotide increased resistance to nuclease degradation, increased cellular uptake, and/or increased binding affinity for the target nucleic acid. An additional region of the oligonucleotide may serve as a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids. By way of example, RNase H is a cellular endonuclease which cleaves the RNA strand of an RNA:DNA duplex. Activation of RNase H, therefore, results in cleavage of the RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene expression. Consequently, comparable results can often be obtained with shorter oligonucleotides when chimeric oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides hybridizing to the same target region. Cleavage of the RNA target can be routinely detected by gel electrophoresis and, if necessary, associated nucleic acid hybridization techniques known in the art.

[0052] Chimeric antisense compounds of the invention may be formed as composite structures of two or more oligonucleotides, modified oligonucleotides, oligonucleosides and/or oligonucleotide mimetics as described above. Such compounds have also been referred to in the art as hybrids or gapmers. Representative United States patents that teach the preparation of such hybrid structures include, but are not limited to, U.S.: U.S. Pat. Nos. 5,013,830; 5,149,797; 5,220,007; 5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

[0053] The antisense compounds used in accordance with this invention may be conveniently and routinely made through the well-known technique of solid phase synthesis. Equipment for such synthesis is sold by several vendors including, for example, Applied Biosystems (Foster City, Calif.). Any other means for such synthesis known in the art may additionally or alternatively be employed. It is well known to use similar techniques to prepare oligonucleotides such as the phosphorothioates and alkylated derivatives.

[0054] The antisense compounds of the invention are synthesized in vitro and do not include antisense compositions of biological origin, or genetic vector constructs designed to direct the in vivo synthesis of antisense molecules. The compounds of the invention may also be admixed, encapsulated, conjugated or otherwise associated with other molecules, molecule structures or mixtures of compounds, as for example, liposomes, receptor targeted molecules, oral, rectal, topical or other formulations, for assisting in uptake, distribution and/or absorption. Representative United States patents that teach the preparation of such uptake, distribution and/or absorption assisting formulations include, but are not limited to, U.S.: U.S. Pat. Nos. 5,108,921; 5,354,844; 5,416,016; 5,459,127; 5,521,291; 5,543,158; 5,547,932; 5,583,020; 5,591,721; 4,426,330; 4,534,899; 5,013,556; 5,108,921; 5,213,804; 5,227,170; 5,264,221; 5,356,633; 5,395,619; 5,416,016; 5,417,978; 5,462,854; 5,469,854; 5,512,295; 5,527,528; 5,534,259; 5,543,152; 5,556,948; 5,580,575; and 5,595,756, each of which is herein incorporated by reference.

[0055] The antisense compounds of the invention encompass any pharmaceutically acceptable salts, esters, or salts of such esters, or any other compound which, upon administration to an animal including a human, is capable of providing (directly or indirectly) the biologically active metabolite or residue thereof. Accordingly, for example, the disclosure is also drawn to prodrugs and pharmaceutically acceptable salts of the compounds of the invention, pharmaceutically acceptable salts of such prodrugs, and other bioequivalents.

[0056] The term “prodrug” indicates a therapeutic agent that is prepared in an inactive form that is converted to an active form (i.e., drug) within the body or cells thereof by the action of endogenous enzymes or other chemicals and/or conditions. In particular, prodrug versions of the oligonucleotides of the invention are prepared as SATE [(S-acetyl-2-thioethyl) phosphate] derivatives according to the methods disclosed in WO 93/24510 to Gosselin et al., published Dec. 9, 1993 or in WO 94/26764 and U.S. Pat. No. 5,770,713 to Imbach et al.

[0057] The term “pharmaceutically acceptable salts” refers to physiologically and pharmaceutically acceptable salts of the compounds of the invention: i.e., salts that retain the desired biological activity of the parent compound and do not impart undesired toxicological effects thereto.

[0058] Pharmaceutically acceptable base addition salts are formed with metals or amines, such as alkali and alkaline earth metals or organic amines. Examples of metals used as cations are sodium, potassium, magnesium, calcium, and the like. Examples of suitable amines are N,N′-dibenzylethylenediamine, chloroprocaine, choline, diethanolamine, dicyclohexylamine, ethylenediamine, N-methylglucamine, and procaine (see, for example, Berge et al., “Pharmaceutical Salts,” J. of Pharma Sci., 1977, 66, 1-19). The base addition salts of said acidic compounds are prepared by contacting the free acid form with a sufficient amount of the desired base to produce the salt in the conventional manner. The free acid form may be regenerated by contacting the salt form with an acid and isolating the free acid in the conventional manner. The free acid forms differ from their respective salt forms somewhat in certain physical properties such as solubility in polar solvents, but otherwise the salts are equivalent to their respective free acid for purposes of the present invention. As used herein, a “pharmaceutical addition salt” includes a pharmaceutically acceptable salt of an acid form of one of the components of the compositions of the invention. These include organic or inorganic acid salts of the amines. Preferred acid salts are the hydrochlorides, acetates, salicylates, nitrates and phosphates. Other suitable pharmaceutically acceptable salts are well known to those skilled in the art and include basic salts of a variety of inorganic and organic acids, such as, for example, with inorganic acids, such as for example hydrochloric acid, hydrobromic acid, sulfuric acid or phosphoric acid; with organic carboxylic, sulfonic, sulfo or phospho acids or N-substituted sulfamic acids, for example acetic acid, propionic acid, glycolic acid, succinic acid, maleic acid, hydroxymaleic acid, methylmaleic acid, fumaric acid, malic acid, tartaric acid, lactic acid, oxalic acid, gluconic acid, glucaric acid, glucuronic acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, salicylic acid, 4-aminosalicylic acid, 2-phenoxybenzoic acid, 2-acetoxybenzoic acid, embonic acid, nicotinic acid or isonicotinic acid; and with amino acids, such as the 20 alpha-amino acids involved in the synthesis of proteins in nature, for example glutamic acid or aspartic acid, and also with phenylacetic acid, methanesulfonic acid, ethanesulfonic acid, 2-hydroxyethanesulfonic acid, ethane-1,2-disulfonic acid, benzenesulfonic acid, 4-methylbenzenesulfonic acid, naphthalene-2-sulfonic acid, naphthalene-1,5-disulfonic acid, 2- or 3-phosphoglycerate, glucose-6-phosphate, N-cyclohexylsulfamic acid (with the formation of cyclamates), or with other acid organic compounds, such as ascorbic acid. Pharmaceutically acceptable salts of compounds may also be prepared with a pharmaceutically acceptable cation. Suitable pharmaceutically acceptable cations are well known to those skilled in the art and include alkaline, alkaline earth, ammonium and quaternary ammonium cations. Carbonates or hydrogen carbonates are also possible.

[0059] For oligonucleotides, preferred examples of pharmaceutically acceptable salts include but are not limited to (a) salts formed with cations such as sodium, potassium, ammonium, magnesium, calcium, polyamines such as spermine and spermidine, etc.; (b) acid addition salts formed with inorganic acids, for example hydrochloric acid, hydrobromic acid, sulfuric acid, phosphoric acid, nitric acid and the like; (c) salts formed with organic acids such as, for example, acetic acid, oxalic acid, tartaric acid, succinic acid, maleic acid, fumaric acid, gluconic acid, citric acid, malic acid, ascorbic acid, benzoic acid, tannic acid, palmitic acid, alginic acid, polyglutamic acid, naphthalenesulfonic acid, methanesulfonic acid, p-toluenesulfonic acid, naphthalenedisulfonic acid, polygalacturonic acid, and the like; and (d) salts formed from elemental anions such as chlorine, bromine, and iodine.

[0060] The antisense compounds of the present invention can be utilized for diagnostics, therapeutics, prophylaxis and as research reagents and kits. For therapeutics, an animal, preferably a human, suspected of having a disease or disorder which can be treated by modulating the expression of NOD1 is treated by administering antisense compounds in accordance with this invention. The compounds of the invention can be utilized in pharmaceutical compositions by adding an effective amount of an antisense compound to a suitable pharmaceutically acceptable diluent or carrier. Use of the antisense compounds and methods of the invention may also be useful prophylactically, e.g., to prevent or delay infection, inflammation or tumor formation, for example.

[0061] The antisense compounds of the invention are useful for research and diagnostics, because these compounds hybridize to nucleic acids encoding NOD1, enabling sandwich and other assays to easily be constructed to exploit this fact. Hybridization of the antisense oligonucleotides of the invention with a nucleic acid encoding NOD1 can be detected by means known in the art. Such means may include conjugation of an enzyme to the oligonucleotide, radiolabelling of the oligonucleotide or any other suitable detection means. Kits using such detection means for detecting the level of NOD1 in a sample may also be prepared.

[0062] The present invention also includes pharmaceutical compositions and formulations which include the antisense compounds of the invention. The pharmaceutical compositions of the present invention may be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration may be topical (including ophthalmic and to mucous membranes including vaginal and rectal delivery), pulmonary, e.g., by inhalation or insufflation of powders or aerosols, including by nebulizer; intratracheal, intranasal, epidermal and transdermal), oral or parenteral. Parenteral administration includes intravenous, intraarterial, subcutaneous, intraperitoneal or intramuscular injection or infusion; or intracranial, e.g., intrathecal or intraventricular, administration. Oligonucleotides with at least one 2′-O-methoxyethyl modification are believed to be particularly useful for oral administration.

[0063] Pharmaceutical compositions and formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. Coated condoms, gloves and the like may also be useful. Preferred topical formulations include those in which the oligonucleotides of the invention are in admixture with a topical delivery agent such as lipids, liposomes, fatty acids, fatty acid esters, steroids, chelating agents and surfactants. Preferred lipids and liposomes include neutral (e.g. dioleoylphosphatidyl DOPE ethanolamine, dimyristoylphosphatidyl choline DMPC, distearolyphosphatidyl choline) negative (e.g. dimyristoylphosphatidyl glycerol DMPG) and cationic (e.g. dioleoyltetramethylaminopropyl DOTAP and dioleoylphosphatidyl ethanolamine DOTMA). Oligonucleotides of the invention may be encapsulated within liposomes or may form complexes thereto, in particular to cationic liposomes. Alternatively, oligonucleotides may be complexed to lipids, in particular to cationic lipids. Preferred fatty acids and esters include but are not limited arachidonic acid, oleic acid, eicosanoic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a C₁₋₁₀ alkyl ester (e.g. isopropylmyristate IPM), monoglyceride, diglyceride or pharmaceutically acceptable salt thereof. Topical formulations are described in detail in U.S. patent application Ser. No. 09/315,298 filed on May 20, 1999 which is incorporated herein by reference in its entirety.

[0064] Compositions and formulations for oral administration include powders or granules, microparticulates, nanoparticulates, suspensions or solutions in water or non-aqueous media, capsules, gel capsules, sachets, tablets or minitablets. Thickeners, flavoring agents, diluents, emulsifiers, dispersing aids or binders may be desirable. Preferred oral formulations are those in which oligonucleotides of the invention are administered in conjunction with one or more penetration enhancers surfactants and chelators. Preferred surfactants include fatty acids and/or esters or salts thereof, bile acids and/or salts thereof. Prefered bile acids/salts include chenodeoxycholic acid (CDCA) and ursodeoxychenodeoxycholic acid (UDCA), cholic acid, dehydrocholic acid, deoxycholic acid, glucholic acid, glycholic acid, glycodeoxycholic acid, taurocholic acid, taurodeoxycholic acid, sodium tauro-24,25-dihydro-fusidate, sodium glycodihydrofusidate. Prefered fatty acids include arachidonic acid, undecanoic acid, oleic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a monoglyceride, a diglyceride or a pharmaceutically acceptable salt thereof (e.g. sodium). Also prefered are combinations of penetration enhancers, for example, fatty acids/salts in combination with bile acids/salts. A particularly prefered combination is the sodium salt of lauric acid, capric acid and UDCA. Further penetration enhancers include polyoxyethylene-9-lauryl ether, polyoxyethylene-20-cetyl ether. Oligonucleotides of the invention may be delivered orally in granular form including sprayed dried particles, or complexed to form micro or nanoparticles. Oligonucleotide complexing agents include poly-amino acids; polyimines; polyacrylates; polyalkylacrylates, polyoxethanes, polyalkylcyanoacrylates; cationized gelatins, albumins, starches, acrylates, polyethyleneglycols (PEG) and starches; polyalkylcyanoacrylates; DEAE-derivatized polyimines, pollulans, celluloses and starches. Particularly preferred complexing agents include chitosan, N-trimethylchitosan, poly-L-lysine, polyhistidine, polyornithine, polyspermines, protamine, polyvinylpyridine, polythiodiethylamino-methylethylene P(TDAE), polyaminostyrene (e.g. p-amino), poly(methylcyanoacrylate), poly(ethylcyanoacrylate), poly(butylcyanoacrylate), poly(isobutylcyanoacrylate), poly(isohexylcynaoacrylate), DEAE-methacrylate, DEAE-hexylacrylate, DEAE-acrylamide, DEAE-albumin and DEAE-dextran, polymethylacrylate, polyhexylacrylate, poly(D,L-lactic acid), poly(DL-lactic-co-glycolic acid (PLGA), alginate, and polyethyleneglycol (PEG). Oral formulations for oligonucleotides and their preparation are described in detail in U.S. application Ser. No. 08/886,829 (filed Jul. 1, 1997), Ser. No. 09/108,673 (filed Jul. 1, 1998), Ser. No. 09/256,515 (filed Feb. 23, 1999), Ser. No. 09/082,624 (filed May 21, 1998) and Ser. No. 09/315,298 (filed May 20, 1999) each of which is incorporated herein by reference in their entirety.

[0065] Compositions and formulations for parenteral, intrathecal or intraventricular administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives such as, but not limited to, penetration enhancers, carrier compounds and other pharmaceutically acceptable carriers or excipients.

[0066] Pharmaceutical compositions of the present invention include, but are not limited to, solutions, emulsions, and liposome-containing formulations. These compositions may be generated from a variety of components that include, but are not limited to, preformed liquids, self-emulsifying solids and self-emulsifying semisolids.

[0067] The pharmaceutical formulations of the present invention, which may conveniently be presented in unit dosage form, may be prepared according to conventional techniques well known in the pharmaceutical industry. Such techniques include the step of bringing into association the active ingredients with the pharmaceutical carrier(s) or excipient(s). In general the formulations are prepared by uniformly and intimately bringing into association the active ingredients with liquid carriers or finely divided solid carriers or both, and then, if necessary, shaping the product.

[0068] The compositions of the present invention may be formulated into any of many possible dosage forms such as, but not limited to, tablets, capsules, gel capsules, liquid syrups, soft gels, suppositories, and enemas. The compositions of the present invention may also be formulated as suspensions in aqueous, non-aqueous or mixed media. Aqueous suspensions may further contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.

[0069] In one embodiment of the present invention the pharmaceutical compositions may be formulated and used as foams. Pharmaceutical foams include formulations such as, but not limited to, emulsions, microemulsions, creams, jellies and liposomes. While basically similar in nature these formulations vary in the components and the consistency of the final product. The preparation of such compositions and formulations is generally known to those skilled in the pharmaceutical and formulation arts and may be applied to the formulation of the compositions of the present invention.

[0070] Emulsions

[0071] The compositions of the present invention may be prepared and formulated as emulsions. Emulsions are typically heterogenous systems of one liquid dispersed in another in the form of droplets usually exceeding 0.1 μm in diameter. (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199; Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., Volume 1, p. 245; Block in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 2, p. 335; Higuchi et al., in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985, p. 301). Emulsions are often biphasic systems comprising of two immiscible liquid phases intimately mixed and dispersed with each other. In general, emulsions may be either water-in-oil (w/o) or of the oil-in-water (o/w) variety. When an aqueous phase is finely divided into and dispersed as minute droplets into a bulk oily phase the resulting composition is called a water-in-oil (w/o) emulsion. Alternatively, when an oily phase is finely divided into and dispersed as minute droplets into a bulk aqueous phase the resulting composition is called an oil-in-water (o/w) emulsion. Emulsions may contain additional components in addition to the dispersed phases and the active drug which may be present as a solution in either the aqueous phase, oily phase or itself as a separate phase. Pharmaceutical excipients such as emulsifiers, stabilizers, dyes, and anti-oxidants may also be present in emulsions as needed. Pharmaceutical emulsions may also be multiple emulsions that are comprised of more than two phases such as, for example, in the case of oil-in-water-in-oil (o/w/o) and water-in-oil-in-water (w/o/w) emulsions. Such complex formulations often provide certain advantages that simple binary emulsions do not. Multiple emulsions in which individual oil droplets of an o/w emulsion enclose small water droplets constitute a w/o/w emulsion. Likewise a system of oil droplets enclosed in globules of water stabilized in an oily continuous provides an o/w/o emulsion.

[0072] Emulsions are characterized by little or no thermodynamic stability. Often, the dispersed or discontinuous phase of the emulsion is well dispersed into the external or continuous phase and maintained in this form through the means of emulsifiers or the viscosity of the formulation. Either of the phases of the emulsion may be a semisolid or a solid, as is the case of emulsion-style ointment bases and creams. Other means of stabilizing emulsions entail the use of emulsifiers that may be incorporated into either phase of the emulsion. Emulsifiers may broadly be classified into four categories: synthetic surfactants, naturally occurring emulsifiers, absorption bases, and finely dispersed solids (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199).

[0073] Synthetic surfactants, also known as surface active agents, have found wide applicability in the formulation of emulsions and have been reviewed in the literature (Rieger, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 285; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), Marcel Dekker, Inc., New York, N.Y., 1988, volume 1, p. 199). Surfactants are typically amphiphilic and comprise a hydrophilic and a hydrophobic portion. The ratio of the hydrophilic to the hydrophobic nature of the surfactant has been termed the hydrophile/lipophile balance (HLB) and is a valuable tool in categorizing and selecting surfactants in the preparation of formulations. Surfactants may be classified into different classes based on the nature of the hydrophilic group: nonionic, anionic, cationic and amphoteric (Rieger, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 285).

[0074] Naturally occurring emulsifiers used in emulsion formulations include lanolin, beeswax, phosphatides, lecithin and acacia. Absorption bases possess hydrophilic properties such that they can soak up water to form w/o emulsions yet retain their semisolid consistencies, such as anhydrous lanolin and hydrophilic petrolatum. Finely divided solids have also been used as good emulsifiers especially in combination with surfactants and in viscous preparations. These include polar inorganic solids, such as heavy metal hydroxides, nonswelling clays such as bentonite, attapulgite, hectorite, kaolin, montmorillonite, colloidal aluminum silicate and colloidal magnesium aluminum silicate, pigments and nonpolar solids such as carbon or glyceryl tristearate.

[0075] A large variety of non-emulsifying materials are also included in emulsion formulations and contribute to the properties of emulsions. These include fats, oils, waxes, fatty acids, fatty alcohols, fatty esters, humectants, hydrophilic colloids, preservatives and antioxidants (Block, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 335; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199).

[0076] Hydrophilic colloids or hydrocolloids include naturally occurring gums and synthetic polymers such as polysaccharides (for example, acacia, agar, alginic acid, carrageenan, guar gum, karaya gum, and tragacanth), cellulose derivatives (for example, carboxymethylcellulose and carboxypropylcellulose), and synthetic polymers (for example, carbomers, cellulose ethers, and carboxyvinyl polymers). These disperse or swell in water to form colloidal solutions that stabilize emulsions by forming strong interfacial films around the dispersed-phase droplets and by increasing the viscosity of the external phase.

[0077] Since emulsions often contain a number of ingredients such as carbohydrates, proteins, sterols and phosphatides that may readily support the growth of microbes, these formulations often incorporate preservatives. Commonly used preservatives included in emulsion formulations include methyl paraben, propyl paraben, quaternary ammonium salts, benzalkonium chloride, esters of p-hydroxybenzoic acid, and boric acid. Antioxidants are also commonly added to emulsion formulations to prevent deterioration of the formulation. Antioxidants used may be free radical scavengers such as tocopherols, alkyl gallates, butylated hydroxyanisole, butylated hydroxytoluene, or reducing agents such as ascorbic acid and sodium metabisulfite, and antioxidant synergists such as citric acid, tartaric acid, and lecithin.

[0078] The application of emulsion formulations via dermatological, oral and parenteral routes and methods for their manufacture have been reviewed in the literature (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199). Emulsion formulations for oral delivery have been very widely used because of reasons of ease of formulation, efficacy from an absorption and bioavailability standpoint. (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199). Mineral-oil base laxatives, oil-soluble vitamins and high fat nutritive preparations are among the materials that have commonly been administered orally as o/w emulsions.

[0079] In one embodiment of the present invention, the compositions of oligonucleotides and nucleic acids are formulated as microemulsions. A microemulsion may be defined as a system of water, oil and amphiphile which is a single optically isotropic and thermodynamically stable liquid solution (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245). Typically microemulsions are systems that are prepared by first dispersing an oil in an aqueous surfactant solution and then adding a sufficient amount of a fourth component, generally an intermediate chain-length alcohol to form a transparent system. Therefore, microemulsions have also been described as thermodynamically stable, isotropically clear dispersions of two immiscible liquids that are stabilized by interfacial films of surface-active molecules (Leung and Shah, in: Controlled Release of Drugs: Polymers and Aggregate Systems, Rosoff, M., Ed., 1989, VCH Publishers, New York, pages 185-215). Microemulsions commonly are prepared via a combination of three to five components that include oil, water, surfactant, cosurfactant and electrolyte. Whether the microemulsion is of the water-in-oil (w/o) or an oil-in-water (o/w) type is dependent on the properties of the oil and surfactant used and on the structure and geometric packing of the polar heads and hydrocarbon tails of the surfactant molecules (Schott, in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985, p. 271).

[0080] The phenomenological approach utilizing phase diagrams has been extensively studied and has yielded a comprehensive knowledge, to one skilled in the art, of how to formulate microemulsions (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245; Block, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 335). Compared to conventional emulsions, microemulsions offer the advantage of solubilizing water-insoluble drugs in a formulation of thermodynamically stable droplets that are formed spontaneously.

[0081] Surfactants used in the preparation of microemulsions include, but are not limited to, ionic surfactants, non-ionic surfactants, Brij 96, polyoxyethylene oleyl ethers, polyglycerol fatty acid esters, tetraglycerol monolaurate (ML310), tetraglycerol monooleate (MO310), hexaglycerol monooleate (PO310), hexaglycerol pentaoleate (PO500), decaglycerol monocaprate (MCA750), decaglycerol monooleate (MO750), decaglycerol sequioleate (SO750), decaglycerol decaoleate (DA0750), alone or in combination with cosurfactants. The cosurfactant, usually a short-chain alcohol such as ethanol, 1-propanol, and 1-butanol, serves to increase the interfacial fluidity by penetrating into the surfactant film and consequently creating a disordered film because of the void space generated among surfactant molecules. Microemulsions may, however, be prepared without the use of cosurfactants and alcohol-free self-emulsifying microemulsion systems are known in the art. The aqueous phase may typically be, but is not limited to, water, an aqueous solution of the drug, glycerol, PEG300, PEG400, polyglycerols, propylene glycols, and derivatives of ethylene glycol. The oil phase may include, but is not limited to, materials such as Captex 300, Captex 355, Capmul MCM, fatty acid esters, medium chain (C₈-C₁₂) mono, di, and tri-glycerides, polyoxyethylated glyceryl fatty acid esters, fatty alcohols, polyglycolized glycerides, saturated polyglycolized C₈-C₁₀ glycerides, vegetable oils and silicone oil.

[0082] Microemulsions are particularly of interest from the standpoint of drug solubilization and the enhanced absorption of drugs. Lipid based microemulsions (both o/w and w/o) have been proposed to enhance the oral bioavailability of drugs, including peptides (Constantinides et al., Pharmaceutical Research, 1994, 11, 1385-1390; Ritschel, Meth. Find. Exp. Clin. Pharmacol., 1993, 13, 205). Microemulsions afford advantages of improved drug solubilization, protection of drug from enzymatic hydrolysis, possible enhancement of drug absorption due to surfactant-induced alterations in membrane fluidity and permeability, ease of preparation, ease of oral administration over solid dosage forms, improved clinical potency, and decreased toxicity (Constantinides et al., Pharmaceutical Research, 1994, 11, 1385; Ho et al., J. Pharm. Sci., 1996, 85, 138-143). Often microemulsions may form spontaneously when their components are brought together at ambient temperature. This may be particularly advantageous when formulating thermolabile drugs, peptides or oligonucleotides. Microemulsions have also been effective in the transdermal delivery of active components in both cosmetic and pharmaceutical applications. It is expected that the microemulsion compositions and formulations of the present invention will facilitate the increased systemic absorption of oligonucleotides and nucleic acids from the gastrointestinal tract, as well as improve the local cellular uptake of oligonucleotides and nucleic acids within the gastrointestinal tract, vagina, buccal cavity and other areas of administration.

[0083] Microemulsions of the present invention may also contain additional components and additives such as sorbitan monostearate (Grill 3), Labrasol, and penetration enhancers to improve the properties of the formulation and to enhance the absorption of the oligonucleotides and nucleic acids of the present invention. Penetration enhancers used in the microemulsions of the present invention may be classified as belonging to one of five broad categories—surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p. 92). Each of these classes has been discussed above.

[0084] Liposomes

[0085] There are many organized surfactant structures besides microemulsions that have been studied and used for the formulation of drugs. These include monolayers, micelles, bilayers and vesicles. Vesicles, such as liposomes, have attracted great interest because of their specificity and the duration of action they offer from the standpoint of drug delivery. As used in the present invention, the term “liposome” means a vesicle composed of amphiphilic lipids arranged in a spherical bilayer or bilayers.

[0086] Liposomes are unilamellar or multilamellar vesicles which have a membrane formed from a lipophilic material and an aqueous interior. The aqueous portion contains the composition to be delivered. Cationic liposomes possess the advantage of being able to fuse to the cell wall. Non-cationic liposomes, although not able to fuse as efficiently with the cell wall, are taken up by macrophages in vivo.

[0087] In order to cross intact mammalian skin, lipid vesicles must pass through a series of fine pores, each with a diameter less than 50 nm, under the influence of a suitable transdermal gradient. Therefore, it is desirable to use a liposome which is highly deformable and able to pass through such fine pores.

[0088] Further advantages of liposomes include; liposomes obtained from natural phospholipids are biocompatible and biodegradable; liposomes can incorporate a wide range of water and lipid soluble drugs; liposomes can protect encapsulated drugs in their internal compartments from metabolism and degradation (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245). Important considerations in the preparation of liposome formulations are the lipid surface charge, vesicle size and the aqueous volume of the liposomes.

[0089] Liposomes are useful for the transfer and delivery of active ingredients to the site of action. Because the liposomal membrane is structurally similar to biological membranes, when liposomes are applied to a tissue, the liposomes start to merge with the cellular membranes. As the merging of the liposome and cell progresses, the liposomal contents are emptied into the cell where the active agent may act.

[0090] Liposomal formulations have been the focus of extensive investigation as the mode of delivery for many drugs. There is growing evidence that for topical administration, liposomes present several advantages over other formulations. Such advantages include reduced side-effects related to high systemic absorption of the administered drug, increased accumulation of the administered drug at the desired target, and the ability to administer a wide variety of drugs, both hydrophilic and hydrophobic, into the skin.

[0091] Several reports have detailed the ability of liposomes to deliver agents including high-molecular weight DNA into the skin. Compounds including analgesics, antibodies, hormones and high-molecular weight DNAs have been administered to the skin. The majority of applications resulted in the targeting of the upper epidermis.

[0092] Liposomes fall into two broad classes. Cationic liposomes are positively charged liposomes which interact with the negatively charged DNA molecules to form a stable complex. The positively charged DNA/liposome complex binds to the negatively charged cell surface and is internalized in an endosome. Due to the acidic pH within the endosome, the liposomes are ruptured, releasing their contents into the cell cytoplasm (Wang et al., Biochem. Biophys. Res. Commun., 1987, 147, 980-985).

[0093] Liposomes which are pH-sensitive or negatively-charged, entrap DNA rather than complex with it. Since both the DNA and the lipid are similarly charged, repulsion rather than complex formation occurs. Nevertheless, some DNA is entrapped within the aqueous interior of these liposomes. pH-sensitive liposomes have been used to deliver DNA encoding the thymidine kinase gene to cell monolayers in culture. Expression of the exogenous gene was detected in the target cells (Zhou et al., Journal of Controlled Release, 1992, 19, 269-274).

[0094] One major type of liposomal composition includes phospholipids other than naturally-derived phosphatidylcholine. Neutral liposome compositions, for example, can be formed from dimyristoyl phosphatidylcholine (DMPC) or dipalmitoyl phosphatidylcholine (DPPC). Anionic liposome compositions generally are formed from dimyristoyl phosphatidylglycerol, while anionic fusogenic liposomes are formed primarily from dioleoyl phosphatidylethanolamine (DOPE). Another type of liposomal composition is formed from phosphatidylcholine (PC) such as, for example, soybean PC, and egg PC. Another type is formed from mixtures of phospholipid and/or phosphatidylcholine and/or cholesterol.

[0095] Several studies have assessed the topical delivery of liposomal drug formulations to the skin. Application of liposomes containing interferon to guinea pig skin resulted in a reduction of skin herpes sores while delivery of interferon via other means (e.g. as a solution or as an emulsion) were ineffective (Weiner et al., Journal of Drug Targeting, 1992, 2, 405-410). Further, an additional study tested the efficacy of interferon administered as part of a liposomal formulation to the administration of interferon using an aqueous system, and concluded that the liposomal formulation was superior to aqueous administration (du Plessis et al., Antiviral Research, 1992, 18, 259-265).

[0096] Non-ionic liposomal systems have also been examined to determine their utility in the delivery of drugs to the skin, in particular systems comprising non-ionic surfactant and cholesterol. Non-ionic liposomal formulations comprising Novasome™ I (glyceryl dilaurate/cholesterol/polyoxyethylene-10-stearyl ether) and Novasome™ II (glyceryl distearate/cholesterol/polyoxyethylene-10-stearyl ether) were used to deliver cyclosporin-A into the dermis of mouse skin. Results indicated that such non-ionic liposomal systems were effective in facilitating the deposition of cyclosporin-A into different layers of the skin (Hu et al. S. T. P. Pharma. Sci., 1994, 4, 6, 466).

[0097] Liposomes also include “sterically stabilized” liposomes, a term which, as used herein, refers to liposomes comprising one or more specialized lipids that, when incorporated into liposomes, result in enhanced circulation lifetimes relative to liposomes lacking such specialized lipids. Examples of sterically stabilized liposomes are those in which part of the vesicle-forming lipid portion of the liposome (A) comprises one or more glycolipids, such as monosialoganglioside G_(M1), or (B) is derivatized with one or more hydrophilic polymers, such as a polyethylene glycol (PEG) moiety. While not wishing to be bound by any particular theory, it is thought in the art that, at least for sterically stabilized liposomes containing gangliosides, sphingomyelin, or PEG-derivatized lipids, the enhanced circulation half-life of these sterically stabilized liposomes derives from a reduced uptake into cells of the reticuloendothelial system (RES) (Allen et al., FEBS Letters, 1987, 223, 42; Wu et al., Cancer Research, 1993, 53, 3765).

[0098] Various liposomes comprising one or more glycolipids are known in the art. Papahadjopoulos et al. (Ann. N.Y. Acad. Sci., 1987, 507, 64) reported the ability of monosialoganglioside G_(M1), galactocerebroside sulfate and phosphatidylinositol to improve blood half-lives of liposomes. These findings were expounded upon by Gabizon et al. (Proc. Natl. Acad. Sci. U.S.A., 1988, 85, 6949). U.S. Pat. No. 4,837,028 and WO 88/04924, both to Allen et al., disclose liposomes comprising (1) sphingomyelin and (2) the ganglioside G_(M1) or a galactocerebroside sulfate ester. U.S. Pat. No. 5,543,152 (Webb et al.) discloses liposomes comprising sphingomyelin. Liposomes comprising 1,2-sn-dimyristoylphosphatidylcholine are disclosed in WO 97/13499 (Lim et al.).

[0099] Many liposomes comprising lipids derivatized with one or more hydrophilic polymers, and methods of preparation thereof, are known in the art. Sunamoto et al. (Bull. Chem. Soc. Jpn., 1980, 53, 2778) described liposomes comprising a nonionic detergent, 2C₁₂15G, that contains a PEG moiety. Illum et al. (FEBS Lett., 1984, 167, 79) noted that hydrophilic coating of polystyrene particles with polymeric glycols results in significantly enhanced blood half-lives. Synthetic phospholipids modified by the attachment of carboxylic groups of polyalkylene glycols (e.g., PEG) are described by Sears (U.S. Pat. Nos. 4,426,330 and 4,534,899). Klibanov et al. (FEBS Lett., 1990, 268, 235) described experiments demonstrating that liposomes comprising phosphatidylethanolamine (PE) derivatized with PEG or PEG stearate have significant increases in blood circulation half-lives. Blume et al. (Biochimica et Biophysica Acta, 1990, 1029, 91) extended such observations to other PEG-derivatized phospholipids, e.g., DSPE-PEG, formed from the combination of distearoylphosphatidylethanolamine (DSPE) and PEG. Liposomes having covalently bound PEG moieties on their external surface are described in European Patent No. EP 0 445 131 B1 and WO 90/04384 to Fisher. Liposome compositions containing 1-20 mole percent of PE derivatized with PEG, and methods of use thereof, are described by Woodle et al. (U.S. Pat. Nos. 5,013,556 and 5,356,633) and Martin et al. (U.S. Pat. No. 5,213,804 and European Patent No. EP 0 496 813 B1). Liposomes comprising a number of other lipid-polymer conjugates are disclosed in WO 91/05545 and U.S. Pat. No. 5,225,212 (both to Martin et al.) and in WO 94/20073 (Zalipsky et al.) Liposomes comprising PEG-modified ceramide lipids are described in WO 96/10391 (Choi et al.). U.S. Pat. Nos. 5,540,935 (Miyazaki et al.) and 5,556,948 (Tagawa et al.) describe PEG-containing liposomes that can be further derivatized with functional moieties on their surfaces.

[0100] A limited number of liposomes comprising nucleic acids are known in the art. WO 96/40062 to Thierry et al. discloses methods for encapsulating high molecular weight nucleic acids in liposomes. U.S. Pat. No. 5,264,221 to Tagawa et al. discloses protein-bonded liposomes and asserts that the contents of such liposomes may include an antisense RNA. U.S. Pat. No. 5,665,710 to Rahman et al. describes certain methods of encapsulating oligodeoxynucleotides in liposomes. WO 97/04787 to Love et al. discloses liposomes comprising antisense oligonucleotides targeted to the raf gene.

[0101] Transfersomes are yet another type of liposomes, and are highly deformable lipid aggregates which are attractive candidates for drug delivery vehicles. Transfersomes may be described as lipid droplets which are so highly deformable that they are easily able to penetrate through pores which are smaller than the droplet. Transfersomes are adaptable to the environment in which they are used, e.g. they are self-optimizing (adaptive to the shape of pores in the skin), self-repairing, frequently reach their targets without fragmenting, and often self-loading. To make transfersomes it is possible to add surface edge-activators, usually surfactants, to a standard liposomal composition. Transfersomes have been used to deliver serum albumin to the skin. The transfersome-mediated delivery of serum albumin has been shown to be as effective as subcutaneous injection of a solution containing serum albumin.

[0102] Surfactants find wide application in formulations such as emulsions (including microemulsions) and liposomes. The most common way of classifying and ranking the properties of the many different types of surfactants, both natural and synthetic, is by the use of the hydrophile/lipophile balance (HLB). The nature of the hydrophilic group (also known as the “head”) provides the most useful means for categorizing the different surfactants used in formulations (Rieger, in Pharmaceutical Dosage Forms, Marcel Dekker, Inc., New York, N.Y., 1988, p. 285).

[0103] If the surfactant molecule is not ionized, it is classified as a nonionic surfactant. Nonionic surfactants find wide application in pharmaceutical and cosmetic products and are usable over a wide range of pH values. In general their HLB values range from 2 to about 18 depending on their structure. Nonionic surfactants include nonionic esters such as ethylene glycol esters, propylene glycol esters, glyceryl esters, polyglyceryl esters, sorbitan esters, sucrose esters, and ethoxylated esters. Nonionic alkanolamides and ethers such as fatty alcohol ethoxylates, propoxylated alcohols, and ethoxylated/propoxylated block polymers are also included in this class. The polyoxyethylene surfactants are the most popular members of the nonionic surfactant class.

[0104] If the surfactant molecule carries a negative charge when it is dissolved or dispersed in water, the surfactant is classified as anionic. Anionic surfactants include carboxylates such as soaps, acyl lactylates, acyl amides of amino acids, esters of sulfuric acid such as alkyl sulfates and ethoxylated alkyl sulfates, sulfonates such as alkyl benzene sulfonates, acyl isethionates, acyl taurates and sulfosuccinates, and phosphates. The most important members of the anionic surfactant class are the alkyl sulfates and the soaps.

[0105] If the surfactant molecule carries a positive charge when it is dissolved or dispersed in water, the surfactant is classified as cationic. Cationic surfactants include quaternary ammonium salts and ethoxylated amines. The quaternary ammonium salts are the most used members of this class.

[0106] If the surfactant molecule has the ability to carry either a positive or negative charge, the surfactant is classified as amphoteric. Amphoteric surfactants include acrylic acid derivatives, substituted alkylamides, N-alkylbetaines and phosphatides.

[0107] The use of surfactants in drug products, formulations and in emulsions has been reviewed (Rieger, in Pharmaceutical Dosage Forms, Marcel Dekker, Inc., New York, N.Y., 1988, p. 285).

[0108] Penetration Enhancers

[0109] In one embodiment, the present invention employs various penetration enhancers to effect the efficient delivery of nucleic acids, particularly oligonucleotides, to the skin of animals. Most drugs are present in solution in both ionized and nonionized forms. However, usually only lipid soluble or lipophilic drugs readily cross cell membranes. It has been discovered that even non-lipophilic drugs may cross cell membranes if the membrane to be crossed is treated with a penetration enhancer. In addition to aiding the diffusion of non-lipophilic drugs across cell membranes, penetration enhancers also enhance the permeability of lipophilic drugs.

[0110] Penetration enhancers may be classified as belonging to one of five broad categories, i.e., surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92). Each of the above mentioned classes of penetration enhancers are described below in greater detail.

[0111] Surfactants: In connection with the present invention, surfactants (or “surface-active agents”) are chemical entities which, when dissolved in an aqueous solution, reduce the surface tension of the solution or the interfacial tension between the aqueous solution and another liquid, with the result that absorption of oligonucleotides through the mucosa is enhanced. In addition to bile salts and fatty acids, these penetration enhancers include, for example, sodium lauryl sulfate, polyoxyethylene-9-lauryl ether and polyoxyethylene-20-cetyl ether) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92); and perfluorochemical emulsions, such as FC-43. Takahashi et al., J. Pharm. Pharmacol., 1988, 40, 252).

[0112] Fatty acids: Various fatty acids and their derivatives which act as penetration enhancers include, for example, oleic acid, lauric acid, capric acid (n-decanoic acid), myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein (1-monooleoyl-rac-glycerol), dilaurin, caprylic acid, arachidonic acid, glycerol 1-monocaprate, 1-dodecylazacycloheptan-2-one, acylcarnitines, acylcholines, C₁₋₁₀ alkyl esters thereof (e.g., methyl, isopropyl and t-butyl), and mono- and di-glycerides thereof (i.e., oleate, laurate, caprate, myristate, palmitate, stearate, linoleate, etc.) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; El Hariri et al., J. Pharm. Pharmacol., 1992, 44, 651-654).

[0113] Bile salts: The physiological role of bile includes the facilitation of dispersion and absorption of lipids and fat-soluble vitamins (Brunton, Chapter 38 in: Goodman & Gilman's The Pharmacological Basis of Therapeutics, 9th Ed., Hardman et al. Eds., McGraw-Hill, New York, 1996, pp. 934-935). Various natural bile salts, and their synthetic derivatives, act as penetration enhancers. Thus the term “bile salts” includes any of the naturally occurring components of bile as well as any of their synthetic derivatives. The bile salts of the invention include, for example, cholic acid (or its pharmaceutically acceptable sodium salt, sodium cholate), dehydrocholic acid (sodium dehydrocholate), deoxycholic acid (sodium deoxycholate), glucholic acid (sodium glucholate), glycholic acid (sodium glycocholate), glycodeoxycholic acid (sodium glycodeoxycholate), taurocholic acid (sodium taurocholate), taurodeoxycholic acid (sodium taurodeoxycholate), chenodeoxycholic acid (sodium chenodeoxycholate), ursodeoxycholic acid (UDCA), sodium tauro-24,25-dihydro-fusidate (STDHF), sodium glycodihydrofusidate and polyoxyethylene-9-lauryl ether (POE) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92; Swinyard, Chapter 39 In: Remington's Pharmaceutical Sciences, 18th Ed., Gennaro, ed., Mack Publishing Co., Easton, Pa., 1990, pages 782-783; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; Yamamoto et al., J. Pharm. Exp. Ther., 1992, 263, 25; Yamashita et al., J. Pharm. Sci., 1990, 79, 579-583).

[0114] Chelating Agents: Chelating agents, as used in connection with the present invention, can be defined as compounds that remove metallic ions from solution by forming complexes therewith, with the result that absorption of oligonucleotides through the mucosa is enhanced. With regards to their use as penetration enhancers in the present invention, chelating agents have the added advantage of also serving as DNase inhibitors, as most characterized DNA nucleases require a divalent metal ion for catalysis and are thus inhibited by chelating agents (Jarrett, J. Chromatogr., 1993, 618, 315-339). Chelating agents of the invention include but are not limited to disodium ethylenediaminetetraacetate (EDTA), citric acid, salicylates (e.g., sodium salicylate, 5-methoxysalicylate and homovanilate), N-acyl derivatives of collagen, laureth-9 and N-amino acyl derivatives of beta-diketones (enamines)(Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; Buur et al., J. Control Rel., 1990, 14, 43-51).

[0115] Non-chelating non-surfactants: As used herein, non-chelating non-surfactant penetration enhancing compounds can be defined as compounds that demonstrate insignificant activity as chelating agents or as surfactants but that nonetheless enhance absorption of oligonucleotides through the alimentary mucosa (Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33). This class of penetration enhancers include, for example, unsaturated cyclic ureas, 1-alkyl- and 1-alkenylazacyclo-alkanone derivatives (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92); and non-steroidal anti-inflammatory agents such as diclofenac sodium, indomethacin and phenylbutazone (Yamashita et al., J. Pharm. Pharmacol., 1987, 39, 621-626).

[0116] Agents that enhance uptake of oligonucleotides at the cellular level may also be added to the pharmaceutical and other compositions of the present invention. For example, cationic lipids, such as lipofectin (Junichi et al, U.S. Pat. No. 5,705,188), cationic glycerol derivatives, and polycationic molecules, such as polylysine (Lollo et al., PCT Application WO 97/30731), are also known to enhance the cellular uptake of oligonucleotides.

[0117] Other agents may be utilized to enhance the penetration of the administered nucleic acids, including glycols such as ethylene glycol and propylene glycol, pyrrols such as 2-pyrrol, azones, and terpenes such as limonene and menthone.

[0118] Carriers

[0119] Certain compositions of the present invention also incorporate carrier compounds in the formulation. As used herein, “carrier compound” or “carrier” can refer to a nucleic acid, or analog thereof, which is inert (i.e., does not possess biological activity per se) but is recognized as a nucleic acid by in vivo processes that reduce the bioavailability of a nucleic acid having biological activity by, for example, degrading the biologically active nucleic acid or promoting its removal from circulation. The coadministration of a nucleic acid and a carrier compound, typically with an excess of the latter substance, can result in a substantial reduction of the amount of nucleic acid recovered in the liver, kidney or other extracirculatory reservoirs, presumably due to competition between the carrier compound and the nucleic acid for a common receptor. For example, the recovery of a partially phosphorothioate oligonucleotide in hepatic tissue can be reduced when it is coadministered with polyinosinic acid, dextran sulfate, polycytidic acid or 4-acetamido-4′isothiocyano-stilbene-2,2′-disulfonic acid (Miyao et al., Antisense Res. Dev., 1995, 5, 115-121; Takakura et al., Antisense & Nucl. Acid Drug Dev., 1996, 6, 177-183).

[0120] Excipients

[0121] In contrast to a carrier compound, a “pharmaceutical carrier” or “excipient” is a pharmaceutically acceptable solvent, suspending agent or any other pharmacologically inert vehicle for delivering one or more nucleic acids to an animal. The excipient may be liquid or solid and is selected, with the planned manner of administration in mind, so as to provide for the desired bulk, consistency, etc., when combined with a nucleic acid and the other components of a given pharmaceutical composition. Typical pharmaceutical carriers include, but are not limited to, binding agents (e.g., pregelatinized maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose, etc.); fillers (e.g., lactose and other sugars, microcrystalline cellulose, pectin, gelatin, calcium sulfate, ethyl cellulose, polyacrylates or calcium hydrogen phosphate, etc.); lubricants (e.g., magnesium stearate, talc, silica, colloidal silicon dioxide, stearic acid, metallic stearates, hydrogenated vegetable oils, corn starch, polyethylene glycols, sodium benzoate, sodium acetate, etc.); disintegrants (e.g., starch, sodium starch glycolate, etc.); and wetting agents (e.g., sodium lauryl sulphate, etc.).

[0122] Pharmaceutically acceptable organic or inorganic excipient suitable for non-parenteral administration which do not deleteriously react with nucleic acids can also be used to formulate the compositions of the present invention. Suitable pharmaceutically acceptable carriers include, but are not limited to, water, salt solutions, alcohols, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.

[0123] Formulations for topical administration of nucleic acids may include sterile and non-sterile aqueous solutions, non-aqueous solutions in common solvents such as alcohols, or solutions of the nucleic acids in liquid or solid oil bases. The solutions may also contain buffers, diluents and other suitable additives. Pharmaceutically acceptable organic or inorganic excipients suitable for non-parenteral administration which do not deleteriously react with nucleic acids can be used.

[0124] Suitable pharmaceutically acceptable excipients include, but are not limited to, water, salt solutions, alcohol, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.

[0125] Other Components

[0126] The compositions of the present invention may additionally contain other adjunct components conventionally found in pharmaceutical compositions, at their art-established usage levels. Thus, for example, the compositions may contain additional, compatible, pharmaceutically-active materials such as, for example, antipruritics, astringents, local anesthetics or anti-inflammatory agents, or may contain additional materials useful in physically formulating various dosage forms of the compositions of the present invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers. However, such materials, when added, should not unduly interfere with the biological activities of the components of the compositions of the present invention. The formulations can be sterilized and, if desired, mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the nucleic acid(s) of the formulation.

[0127] Aqueous suspensions may contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.

[0128] Certain embodiments of the invention provide pharmaceutical compositions containing (a) one or more antisense compounds and (b) one or more other chemotherapeutic agents which function by a non-antisense mechanism. Examples of such chemotherapeutic agents include but are not limited to daunorubicin, daunomycin, dactinomycin, doxorubicin, epirubicin, idarubicin, esorubicin, bleomycin, mafosfamide, ifosfamide, cytosine arabinoside, bis-chloroethylnitrosurea, busulfan, mitomycin C, actinomycin D, mithramycin, prednisone, hydroxyprogesterone, testosterone, tamoxifen, dacarbazine, procarbazine, hexamethylmelamine, pentamethylmelamine, mitoxantrone, amsacrine, chlorambucil, methylcyclohexylnitrosurea, nitrogen mustards, melphalan, cyclophosphamide, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-azacytidine, hydroxyurea, deoxycoformycin, 4-hydroxyperoxycyclophosphoramide, 5-fluorouracil (5-FU), 5-fluorodeoxyuridine (5-FUdR), methotrexate (MTX), colchicine, taxol, vincristine, vinblastine, etoposide (VP-16), trimetrexate, irinotecan, topotecan, gemcitabine, teniposide, cisplatin and diethylstilbestrol (DES). See, generally, The Merck Manual of Diagnosis and Therapy, 15th Ed. 1987, pp. 1206-1228, Berkow et al., eds., Rahway, N.J. When used with the compounds of the invention, such chemotherapeutic agents may be used individually (e.g., 5-FU and oligonucleotide), sequentially (e.g., 5-FU and oligonucleotide for a period of time followed by MTX and oligonucleotide), or in combination with one or more other such chemotherapeutic agents (e.g., 5-FU, MTX and oligonucleotide, or 5-FU, radiotherapy and oligonucleotide). Anti-inflammatory drugs, including but not limited to nonsteroidal anti-inflammatory drugs and corticosteroids, and antiviral drugs, including but not limited to ribivirin, vidarabine, acyclovir and ganciclovir, may also be combined in compositions of the invention. See, generally, The Merck Manual of Diagnosis and Therapy, 15th Ed., Berkow et al., eds., 1987, Rahway, N.J., pages 2499-2506 and 46-49, respectively). Other non-antisense chemotherapeutic agents are also within the scope of this invention. Two or more combined compounds may be used together or sequentially.

[0129] In another related embodiment, compositions of the invention may contain one or more antisense compounds, particularly oligonucleotides, targeted to a first nucleic acid and one or more additional antisense compounds targeted to a second nucleic acid target. Numerous examples of antisense compounds are known in the art. Two or more combined compounds may be used together or sequentially.

[0130] The formulation of therapeutic compositions and their subsequent administration is believed to be within the skill of those in the art. Dosing is dependent on severity and responsiveness of the disease state to be treated, with the course of treatment lasting from several days to several months, or until a cure is effected or a diminution of the disease state is achieved. Optimal dosing schedules can be calculated from measurements of drug accumulation in the body of the patient. Persons of ordinary skill can easily determine optimum dosages, dosing methodologies and repetition rates. Optimum dosages may vary depending on the relative potency of individual oligonucleotides, and can generally be estimated based on EC₅₀ s found to be effective in in vitro and in vivo animal models. In general, dosage is from 0.01 ug to 100 g per kg of body weight, and may be given once or more daily, weekly, monthly or yearly, or even once every 2 to 20 years. Persons of ordinary skill in the art can easily estimate repetition rates for dosing based on measured residence times and concentrations of the drug in bodily fluids or tissues. Following successful treatment, it may be desirable to have the patient undergo maintenance therapy to prevent the recurrence of the disease state, wherein the oligonucleotide is administered in maintenance doses, ranging from 0.01 ug to 100 g per kg of body weight, once or more daily, to once every 20 years.

[0131] While the present invention has been described with specificity in accordance with certain of its preferred embodiments, the following examples serve only to illustrate the invention and are not intended to limit the same.

EXAMPLES Example 1

[0132] Nucleoside Phosphoramidites for Oligonucleotide Synthesis Deoxy and 2′-alkoxy Amidites

[0133] 2′-Deoxy and 2′-methoxy beta-cyanoethyldiisopropyl phosphoramidites were purchased from commercial sources (e.g. Chemgenes, Needham Mass. or Glen Research, Inc. Sterling Va.). Other 2′-O-alkoxy substituted nucleoside amidites are prepared as described in U.S. Pat. No. 5,506,351, herein incorporated by reference. For oligonucleotides synthesized using 2′-alkoxy amidites, the standard cycle for unmodified oligonucleotides was utilized, except the wait step after pulse delivery of tetrazole and base was increased to 360 seconds.

[0134] Oligonucleotides containing 5-methyl-2′-deoxycytidine (5-Me-C) nucleotides were synthesized according to published methods [Sanghvi, et. al., Nucleic Acids Research, 1993, 21, 3197-3203] using commercially available phosphoramidites (Glen Research, Sterling Va. or ChemGenes, Needham Mass.).

[0135] 2′-Fluoro Amidites

[0136] 2′-Fluorodeoxyadenosine Amidites

[0137] 2′-fluoro oligonucleotides were synthesized as described previously [Kawasaki, et. al., J. Med. Chem., 1993, 36, 831-841] and U.S. Pat. No. 5,670,633, herein incorporated by reference. Briefly, the protected nucleoside N6-benzoyl-2′-deoxy-2′-fluoroadenosine was synthesized utilizing commercially available 9-beta-D-arabinofuranosyladenine as starting material and by modifying literature procedures whereby the 2′-alpha-fluoro atom is introduced by a S_(N)2-displacement of a 2′-beta-trityl group. Thus N6-benzoyl-9-beta-D-arabinofuranosyladenine was selectively protected in moderate yield as the 3′,5′-ditetrahydropyranyl (THP) intermediate. Deprotection of the THP and N6-benzoyl groups was accomplished using standard methodologies and standard methods were used to obtain the 5′-dimethoxytrityl-(DMT) and 5′-DMT-3′-phosphoramidite intermediates.

[0138] 2′-Fluorodeoxyguanosine

[0139] The synthesis of 2′-deoxy-2′-fluoroguanosine was accomplished using tetraisopropyldisiloxanyl (TPDS) protected 9-beta-D-arabinofuranosylguanine as starting material, and conversion to the intermediate diisobutyryl-arabinofuranosylguanosine. Deprotection of the TPDS group was followed by protection of the hydroxyl group with THP to give diisobutyryl di-THP protected arabinofuranosylguanine. Selective O-deacylation and triflation was followed by treatment of the crude product with fluoride, then deprotection of the THP groups. Standard methodologies were used to obtain the 5′-DMT- and 5′-DMT-3′-phosphoramidites.

[0140] 2′-Fluorouridine

[0141] Synthesis of 2′-deoxy-2′-fluorouridine was accomplished by the modification of a literature procedure in which 2,2′-anhydro-1-beta-D-arabinofuranosyluracil was treated with 70% hydrogen fluoride-pyridine. Standard procedures were used to obtain the 5′-DMT and 5′-DMT-3′phosphoramidites.

[0142] 2′-Fluorodeoxycytidine

[0143] 2′-deoxy-2′-fluorocytidine was synthesized via amination of 2′-deoxy-2′-fluorouridine, followed by selective protection to give N4-benzoyl-2′-deoxy-2′-fluorocytidine. Standard procedures were used to obtain the 5′-DMT and 5′-DMT-3′phosphoramidites.

[0144] 2′-O-(2-Methoxyethyl) Modified Amidites

[0145] 2′-O-Methoxyethyl-substituted nucleoside amidites are prepared as follows, or alternatively, as per the methods of Martin, P., Helvetica Chimica Acta, 1995, 78, 486-504.

[0146] 2,2′-Anhydro[1-(beta-D-arabinofuranosyl)-5-methyluridine]

[0147] 5-Methyluridine (ribosylthymine, commercially available through Yamasa, Choshi, Japan) (72.0 g, 0.279 M), diphenyl-carbonate (90.0 g, 0.420 M) and sodium bicarbonate (2.0 g, 0.024 M) were added to DMF (300 mL). The mixture was heated to reflux, with stirring, allowing the evolved carbon dioxide gas to be released in a controlled manner. After 1 hour, the slightly darkened solution was concentrated under reduced pressure. The resulting syrup was poured into diethylether (2.5 L), with stirring. The product formed a gum. The ether was decanted and the residue was dissolved in a minimum amount of methanol (ca. 400 mL). The solution was poured into fresh ether (2.5 L) to yield a stiff gum. The ether was decanted and the gum was dried in a vacuum oven (60° C. at 1 mm Hg for 24 h) to give a solid that was crushed to a light tan powder (57 g, 85% crude yield). The NMR spectrum was consistent with the structure, contaminated with phenol as its sodium salt (ca. 5%). The material was used as is for further reactions (or it can be purified further by column chromatography using a gradient of methanol in ethyl acetate (10-25%) to give a white solid, mp 222-4° C.).

[0148] 2′-O-Methoxyethyl-5-methyluridine

[0149] 2,2′-Anhydro-5-methyluridine (195 g, 0.81 M), tris(2-methoxyethyl)borate (231 g, 0.98 M) and 2-methoxyethanol (1.2 L) were added to a 2 L stainless steel pressure vessel and placed in a pre-heated oil bath at 160° C. After heating for 48 hours at 155-160° C., the vessel was opened and the solution evaporated to dryness and triturated with MeOH (200 mL). The residue was suspended in hot acetone (1 L). The insoluble salts were filtered, washed with acetone (150 mL) and the filtrate evaporated. The residue (280 g) was dissolved in CH₃CN (600 mL) and evaporated. A silica gel column (3 kg) was packed in CH₂Cl₂/acetone/MeOH (20:5:3) containing 0.5% Et₃NH. The residue was dissolved in CH₂Cl₂ (250 mL) and adsorbed onto silica (150 g) prior to loading onto the column. The product was eluted with the packing solvent to give 160 g (63%) of product. Additional material was obtained by reworking impure fractions.

[0150] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine

[0151] 2′-O-Methoxyethyl-5-methyluridine (160 g, 0.506 M) was co-evaporated with pyridine (250 mL) and the dried residue dissolved in pyridine (1.3 L). A first aliquot of dimethoxytrityl chloride (94.3 g, 0.278 M) was added and the mixture stirred at room temperature for one hour. A second aliquot of dimethoxytrityl chloride (94.3 g, 0.278 M) was added and the reaction stirred for an additional one hour. Methanol (170 mL) was then added to stop the reaction. HPLC showed the presence of approximately 70% product. The solvent was evaporated and triturated with CH₃CN (200 mL). The residue was dissolved in CHCl₃ (1.5 L) and extracted with 2×500 mL of saturated NaHCO₃ and 2×500 mL of saturated NaCl. The organic phase was dried over Na₂SO₄, filtered and evaporated. 275 g of residue was obtained. The residue was purified on a 3.5 kg silica gel column, packed and eluted with EtOAc/hexane/acetone (5:5:1) containing 0.5% Et₃NH. The pure fractions were evaporated to give 164 g of product. Approximately 20 g additional was obtained from the impure fractions to give a total yield of 183 g (57%).

[0152] 3′-O-Acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine

[0153] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine (106 g, 0.167 M), DMF/pyridine (750 mL of a 3:1 mixture prepared from 562 mL of DMF and 188 mL of pyridine) and acetic anhydride (24.38 mL, 0.258 M) were combined and stirred at room temperature for 24 hours. The reaction was monitored by TLC by first quenching the TLC sample with the addition of MeOH. Upon completion of the reaction, as judged by TLC, MeOH (50 mL) was added and the mixture evaporated at 35° C. The residue was dissolved in CHCl₃ (800 mL) and extracted with 2×200 mL of saturated sodium bicarbonate and 2×200 mL of saturated NaCl. The water layers were back extracted with 200 mL of CHCl₃. The combined organics were dried with sodium sulfate and evaporated to give 122 g of residue (approx. 90% product). The residue was purified on a 3.5 kg silica gel column and eluted using EtOAc/hexane(4:1). Pure product fractions were evaporated to yield 96 g (84%). An additional 1.5 g was recovered from later fractions.

[0154] 3′-O-Acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyl-4-triazoleuridine

[0155] A first solution was prepared by dissolving 3′-O-acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine (96 g, 0.144 M) in CH₃CN (700 mL) and set aside. Triethylamine (189 mL, 1.44 M) was added to a solution of triazole (90 g, 1.3 M) in CH₃CN (1 L), cooled to −5° C. and stirred for 0.5 h using an overhead stirrer. POCl₃ was added dropwise, over a 30 minute period, to the stirred solution maintained at 0-10° C., and the resulting mixture stirred for an additional 2 hours. The first solution was added dropwise, over a 45 minute period, to the latter solution. The resulting reaction mixture was stored overnight in a cold room. Salts were filtered from the reaction mixture and the solution was evaporated. The residue was dissolved in EtOAc (1 L) and the insoluble solids were removed by filtration. The filtrate was washed with 1×300 mL of NaHCO₃ and 2×300 mL of saturated NaCl, dried over sodium sulfate and evaporated. The residue was triturated with EtOAc to give the title compound.

[0156] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine

[0157] A solution of 3′-O-acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyl-4-triazoleuridine (103 g, 0.141 M) in dioxane (500 mL) and NH₄OH (30 mL) was stirred at room temperature for 2 hours. The dioxane solution was evaporated and the residue azeotroped with MeOH (2×200 mL). The residue was dissolved in MeOH (300 mL) and transferred to a 2 liter stainless steel pressure vessel. MeOH (400 mL) saturated with NH₃ gas was added and the vessel heated to 100° C. for 2 hours (TLC showed complete conversion). The vessel contents were evaporated to dryness and the residue was dissolved in EtOAc (500 mL) and washed once with saturated NaCl (200 mL). The organics were dried over sodium sulfate and the solvent was evaporated to give 85 g (95%) of the title compound.

[0158] N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine

[0159] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine (85 g, 0.134 M) was dissolved in DMF (800 mL) and benzoic anhydride (37.2 g, 0.165 M) was added with stirring. After stirring for 3 hours, TLC showed the reaction to be approximately 95% complete. The solvent was evaporated and the residue azeotroped with MeOH (200 mL). The residue was dissolved in CHCl₃ (700 mL) and extracted with saturated NaHCO₃ (2×300 mL) and saturated NaCl (2×300 mL), dried over MgSO₄ and evaporated to give a residue (96 g). The residue was chromatographed on a 1.5 kg silica column using EtOAc/hexane (1:1) containing 0.5% Et₃NH as the eluting solvent. The pure product fractions were evaporated to give 90 g (90%) of the title compound.

[0160] N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine-3′-amidite

[0161] N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine (74 g, 0.10 M) was dissolved in CH₂Cl₂ (1 L). Tetrazole diisopropylamine (7.1 g) and 2-cyanoethoxy-tetra-(isopropyl)phosphite (40.5 mL, 0.123 M) were added with stirring, under a nitrogen atmosphere. The resulting mixture was stirred for 20 hours at room temperature (TLC showed the reaction to be 95% complete). The reaction mixture was extracted with saturated NaHCO₃ (1×300 mL) and saturated NaCl (3×300 mL). The aqueous washes were back-extracted with CH₂Cl₂ (300 mL), and the extracts were combined, dried over MgSO₄ and concentrated. The residue obtained was chromatographed on a 1.5 kg silica column using EtOAc/hexane (3:1) as the eluting solvent. The pure fractions were combined to give 90.6 g (87%) of the title compound.

[0162] 2′-O-(Aminooxyethyl) Nucleoside Amidites and 2′-O-(dimethylaminooxyethyl) Nucleoside Amidites

[0163] 2′-(Dimethylaminooxyethoxy) Nucleoside Amidites

[0164] 2′-(Dimethylaminooxyethoxy) nucleoside amidites [also known in the art as 2′-O-(dimethylaminooxyethyl) nucleoside amidites] are prepared as described in the following paragraphs. Adenosine, cytidine and guanosine nucleoside amidites are prepared similarly to the thymidine (5-methyluridine) except the exocyclic amines are protected with a benzoyl moiety in the case of adenosine and cytidine and with isobutyryl in the case of guanosine.

[0165] 5′-O-tert-Butyldiphenylsilyl-O²-2′-anhydro-5-methyluridine

[0166] O²-2′-anhydro-5-methyluridine (Pro. Bio. Sint., Varese, Italy, 100.0 g, 0.416 mmol), dimethylaminopyridine (0.66 g, 0.013 eq, 0.0054 mmol) were dissolved in dry pyridine (500 ml) at ambient temperature under an argon atmosphere and with mechanical stirring. tert-Butyldiphenylchlorosilane (125.8 g, 119.0 mL, 1.1 eq, 0.458 mmol) was added in one portion. The reaction was stirred for 16 h at ambient temperature. TLC (Rf 0.22, ethyl acetate) indicated a complete reaction. The solution was concentrated under reduced pressure to a thick oil. This was partitioned between dichloromethane (1 L) and saturated sodium bicarbonate (2×1 L) and brine (1 L). The organic layer was dried over sodium sulfate and concentrated under reduced pressure to a thick oil. The oil was dissolved in a 1:1 mixture of ethyl acetate and ethyl ether (600 ml) and the solution was cooled to

[0167] −10° C. The resulting crystalline product was collected by filtration, washed with ethyl ether (3×200 mL) and dried (40° C., 1 mm Hg, 24 h) to 149 g (74.8%) of white solid. TLC and NMR were consistent with pure product.

[0168] 5′-O-tert-Butyldiphenylsilyl-2′-O-(2-hydroxyethyl)-5-methyluridine

[0169] In a 2 L stainless steel, unstirred pressure reactor was added borane in tetrahydrofuran (1.0 M, 2.0 eq, 622 mL). In the fume hood and with manual stirring, ethylene glycol (350 mL, excess) was added cautiously at first until the evolution of hydrogen gas subsided. 5′-O-tert-Butyldiphenylsilyl 2-2′-anhydro-5-methyluridine (149 g, 0.311 mol) and sodium bicarbonate (0.074 g, 0.003 eq) were added with manual stirring. The reactor was sealed and heated in an oil bath until an internal temperature of 160° C. was reached and then maintained for 16 h (pressure <100 psig). The reaction vessel was cooled to ambient and opened. TLC (Rf 0.67 for desired product and Rf 0.82 for ara-T side product, ethyl acetate) indicated about 70% conversion to the product. In order to avoid additional side product formation, the reaction was stopped, concentrated under reduced pressure (10 to 1 mm Hg) in a warm water bath (40-100° C.) with the more extreme conditions used to remove the ethylene glycol. [Alternatively, once the low boiling solvent is gone, the remaining solution can be partitioned between ethyl acetate and water. The product will be in the organic phase.] The residue was purified by column chromatography (2 kg silica gel, ethyl acetate-hexanes gradient 1:1 to 4:1). The appropriate fractions were combined, stripped and dried to product as a white crisp foam (84 g, 50%), contaminated starting material (17.4 g) and pure reusable starting material 20 g. The yield based on starting material less pure recovered starting material was 58%. TLC and NMR were consistent with 99% pure product.

[0170] 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine

[0171] 5′-O-tert-Butyldiphenylsilyl-2′-O-(2-hydroxyethyl)-5-methyluridine (20 g, 36.98 mmol) was mixed with triphenylphosphine (11.63 g, 44.36 mmol) and N-hydroxyphthalimide (7.24 g, 44.36 mmol). It was then dried over P₂O₅ under high vacuum for two days at 40° C. The reaction mixture was flushed with argon and dry THF (369.8 mL, Aldrich, sure seal bottle) was added to get a clear solution. Diethyl-azodicarboxylate (6.98 mL, 44.36 mmol) was added dropwise to the reaction mixture. The rate of addition is maintained such that resulting deep red coloration is just discharged before adding the next drop. After the addition was complete, the reaction was stirred for 4 hrs. By that time TLC showed the completion of the reaction (ethylacetate:hexane, 60:40). The solvent was evaporated in vacuum. Residue obtained was placed on a flash column and eluted with ethyl acetate:hexane (60:40), to get 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine as white foam (21.819 g, 86%).

[0172] 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy)ethyl]-5-methyluridine

[0173] 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine (3.1 g, 4.5 mmol) was dissolved in dry CH₂Cl₂ (4.5 mL) and methylhydrazine (300 mL, 4.64 mmol) was added dropwise at −10° C. to 0° C. After 1 h the mixture was filtered, the filtrate was washed with ice cold CH₂Cl₂ and the combined organic phase was washed with water, brine and dried over anhydrous Na₂SO₄. The solution was concentrated to get 2′-O-(aminooxyethyl) thymidine, which was then dissolved in MeOH (67.5 mL). To this formaldehyde (20% aqueous solution, w/w, 1.1 eq.) was added and the resulting mixture was strirred for 1 h. Solvent was removed under vacuum; residue chromatographed to get 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy) ethyl]-5-methyluridine as white foam (1.95 g, 78%).

[0174] 5′-O-tert-Butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine

[0175] 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy)ethyl]-5-methyluridine (1.77 g, 3.12 mmol) was dissolved in a solution of 1M pyridinium p-toluenesulfonate (PPTS) in dry MeOH (30.6 mL). Sodium cyanoborohydride (0.39 g, 6.13 mmol) was added to this solution at 10° C. under inert atmosphere. The reaction mixture was stirred for 10 minutes at 10° C. After that the reaction vessel was removed from the ice bath and stirred at room temperature for 2 h, the reaction monitored by TLC (5% MeOH in CH₂Cl₂). Aqueous NaHCO₃ solution (5%, 10 mL) was added and extracted with ethyl acetate (2×20 mL). Ethyl acetate phase was dried over anhydrous Na₂SO₄, evaporated to dryness. Residue was dissolved in a solution of 1M PPTS in MeOH (30.6 mL). Formaldehyde (20% w/w, 30 mL, 3.37 mmol) was added and the reaction mixture was stirred at room temperature for 10 minutes. Reaction mixture cooled to 10° C. in an ice bath, sodium cyanoborohydride (0.39 g, 6.13 mmol) was added and reaction mixture stirred at 10° C. for 10 minutes. After 10 minutes, the reaction mixture was removed from the ice bath and stirred at room temperature for 2 hrs. To the reaction mixture 5% NaHCO₃ (25 mL) solution was added and extracted with ethyl acetate (2×25 mL). Ethyl acetate layer was dried over anhydrous Na₂SO₄ and evaporated to dryness. The residue obtained was purified by flash column chromatography and eluted with 5% MeOH in CH₂Cl₂ to get 5′-O-tert-butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine as a white foam (14.6 g, 80%).

[0176] 2′-O-(dimethylaminooxyethyl)-5-methyluridine

[0177] Triethylamine trihydrofluoride (3.91 mL, 24.0 mmol) was dissolved in dry THF and triethylamine (1.67 mL, 12 mmol, dry, kept over KOH). This mixture of triethylamine-2HF was then added to 5′-O-tert-butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine (1.40 g, 2.4 mmol) and stirred at room temperature for 24 hrs. Reaction was monitored by TLC (5% MeOH in CH₂Cl₂). Solvent was removed under vacuum and the residue placed on a flash column and eluted with 10% MeOH in CH₂Cl₂ to get 2′-O-(dimethylaminooxyethyl)-5-methyluridine (766 mg, 92.5%).

[0178] 5′-O-DMT-2′-O-(dimethylaminooxyethyl)-5-methyluridine

[0179] 2′-O-(dimethylaminooxyethyl)-5-methyluridine (750 mg, 2.17 mmol) was dried over P₂O₅ under high vacuum overnight at 40° C. It was then co-evaporated with anhydrous pyridine (20 mL). The residue obtained was dissolved in pyridine (11 mL) under argon atmosphere. 4-dimethylaminopyridine (26.5 mg, 2.60 mmol), 4,4′-dimethoxytrityl chloride (880 mg, 2.60 mmol) was added to the mixture and the reaction mixture was stirred at room temperature until all of the starting material disappeared. Pyridine was removed under vacuum and the residue chromatographed and eluted with 10% MeOH in CH₂Cl₂ (containing a few drops of pyridine) to get 5′-O-DMT-2′-0-(dimethylamino-oxyethyl)-5-methyluridine (1.13 g, 80%).

[0180] 5′-O-DMT-2′-O-(2-N,N-dimethylaminooxyethyl)-5-methyluridine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite]

[0181] 5′-O-DMT-2′-O-(dimethylaminooxyethyl)-5-methyluridine (1.08 g, 1.67 mmol) was co-evaporated with toluene (20 mL). To the residue N,N-diisopropylamine tetrazonide (0.29 g, 1.67 mmol) was added and dried over P₂O₅ under high vacuum overnight at 40° C. Then the reaction mixture was dissolved in anhydrous acetonitrile (8.4 mL) and 2-cyanoethyl-N,N,N′,N′-tetraisopropylphosphoramidite (2.12 mL, 6.08 mmol) was added. The reaction mixture was stirred at ambient temperature for 4 hrs under inert atmosphere. The progress of the reaction was monitored by TLC (hexane:ethyl acetate 1:1). The solvent was evaporated, then the residue was dissolved in ethyl acetate (70 mL) and washed with 5% aqueous NaHCO₃ (40 mL). Ethyl acetate layer was dried over anhydrous Na₂SO₄ and concentrated. Residue obtained was chromatographed (ethyl acetate as eluent) to get 5′-O-DMT-2′-O-(2-N,N-dimethylaminooxyethyl)-5-methyluridine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite] as a foam (1.04 g, 74.9%).

[0182] 2′-(Aminooxyethoxy) Nucleoside Amidites

[0183] 2′-(Aminooxyethoxy) nucleoside amidites [also known in the art as 2′-O-(aminooxyethyl) nucleoside amidites] are prepared as described in the following paragraphs. Adenosine, cytidine and thymidine nucleoside amidites are prepared similarly.

[0184] N2-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl)guanosine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite]

[0185] The 2′-O-aminooxyethyl guanosine analog may be obtained by selective 2′-O-alkylation of diaminopurine riboside. Multigram quantities of diaminopurine riboside may be purchased from Schering AG (Berlin) to provide 2′-O-(2-ethylacetyl) diaminopurine riboside along with a minor amount of the 3′-O-isomer. 2′-O-(2-ethylacetyl) diaminopurine riboside may be resolved and converted to 2′-O-(2-ethylacetyl)guanosine by treatment with adenosine deaminase. (McGee, D. P. C., Cook, P. D., Guinosso, C. J., WO 94/02501 Al 940203.) Standard protection procedures should afford 2′-O-(2-ethylacetyl)-5′-o-(4,4′-dimethoxytrityl)guanosine and 2-N-isobutyryl-6-o-diphenylcarbamoyl-2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl)guanosine which may be reduced to provide 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-hydroxyethyl)-5′-O-(4,4′-dimethoxytrityl)guanosine. As before the hydroxyl group may be displaced by N-hydroxyphthalimide via a mitsunobu reaction, and the protected nucleoside may phosphitylated as usual to yield 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-([2-phthalmidoxy]ethyl)-5′-O-(4,4′-dimethoxytrityl)guanosine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite].

[0186] 2′-dimethylaminoethoxyethoxy (2′-DMAEOE) Nucleoside Amidites

[0187] 2′-dimethylaminoethoxyethoxy nucleoside amidites (also known in the art as 2′-O-dimethylaminoethoxyethyl, i.e., 2′-O—CH₂—O—CH₂—N(CH₂)₂, or 2′-DMAEOE nucleoside amidites) are prepared as follows. Other nucleoside amidites are prepared similarly.

[0188] 2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl]-5-methyl Uridine

[0189] 2[2-(Dimethylamino)ethoxy]ethanol (Aldrich, 6.66 g, 50 mmol) is slowly added to a solution of borane in tetrahydrofuran (1 M, 10 mL, 10 mmol) with stirring in a 100 mL bomb. Hydrogen gas evolves as the solid dissolves. O²,2′-anhydro-5-methyluridine (1.2 g, 5 mmol), and sodium bicarbonate (2.5 mg) are added and the bomb is sealed, placed in an oil bath and heated to 155° C. for 26 hours. The bomb is cooled to room temperature and opened. The crude solution is concentrated and the residue partitioned between water (200 mL) and hexanes (200 mL). The excess phenol is extracted into the hexane layer. The aqueous layer is extracted with ethyl acetate (3×200 mL) and the combined organic layers are washed once with water, dried over anhydrous sodium sulfate and concentrated. The residue is columned on silica gel using methanol/methylene chloride 1:20 (which has 2% triethylamine) as the eluent. As the column fractions are concentrated a colorless solid forms which is collected to give the title compound as a white solid.

[0190] 5′-O-dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy) ethyl)]-5-methyl Uridine

[0191] To 0.5 g (1.3 mmol) of 2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl)]-5-methyl uridine in anhydrous pyridine (8 mL), triethylamine (0.36 mL) and dimethoxytrityl chloride (DMT-Cl, 0.87 g, 2 eq.) are added and stirred for 1 hour. The reaction mixture is poured into water (200 mL) and extracted with CH₂Cl₂ (2×200 mL). The combined CH₂Cl₂ layers are washed with saturated NaHCO₃ solution, followed by saturated NaCl solution and dried over anhydrous sodium sulfate. Evaporation of the solvent followed by silica gel chromatography using MeOH:CH₂Cl₂:Et₃N (20:1, v/v, with 1% triethylamine) gives the title compound.

[0192] 5′-O-Dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy)-ethyl)]-5-methyl uridine-3′-O-(cyanoethyl-N,N-diisopropyl)phosphoramidite

[0193] Diisopropylaminotetrazolide (0.6 g) and 2-cyanoethoxy-N,N-diisopropyl phosphoramidite (1.1 mL, 2 eq.) are added to a solution of 5′-O-dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl)]-5-methyluridine (2.17 g, 3 mmol) dissolved in CH₂Cl₂ (20 mL) under an atmosphere of argon. The reaction mixture is stirred overnight and the solvent evaporated. The resulting residue is purified by silica gel flash column chromatography with ethyl acetate as the eluent to give the title compound.

Example 2

[0194] Oligonucleotide Synthesis

[0195] Unsubstituted and substituted phosphodiester (P═O) oligonucleotides are synthesized on an automated DNA synthesizer (Applied Biosystems model 380B) using standard phosphoramidite chemistry with oxidation by iodine.

[0196] Phosphorothioates (P═S) are synthesized as for the phosphodiester oligonucleotides except the standard oxidation bottle was replaced by 0.2 M solution of 3H-1,2-benzodithiole-3-one 1,1-dioxide in acetonitrile for the stepwise thiation of the phosphite linkages. The thiation wait step was increased to 68 sec and was followed by the capping step. After cleavage from the CPG column and deblocking in concentrated ammonium hydroxide at 55° C. (18 h), the oligonucleotides were purified by precipitating twice with 2.5 volumes of ethanol from a 0.5 M NaCl solution.

[0197] Phosphinate oligonucleotides are prepared as described in U.S. Pat. No. 5,508,270, herein incorporated by reference.

[0198] Alkyl phosphonate oligonucleotides are prepared as described in U.S. Pat. No. 4,469,863, herein incorporated by reference.

[0199] 3′-Deoxy-3′-methylene phosphonate oligonucleotides are prepared as described in U.S. Pat. Nos. 5,610,289 or 5,625,050, herein incorporated by reference.

[0200] Phosphoramidite oligonucleotides are prepared as described in U.S. Pat. No. 5,256,775 or U.S. Pat. No. 5,366,878, herein incorporated by reference.

[0201] Alkylphosphonothioate oligonucleotides are prepared as described in published PCT applications PCT/US94/00902 and PCT/US93/06976 (published as WO 94/17093 and WO 94/02499, respectively), herein incorporated by reference.

[0202] 3′-Deoxy-3′-amino phosphoramidate oligonucleotides are prepared as described in U.S. Pat. No. 5,476,925, herein incorporated by reference.

[0203] Phosphotriester oligonucleotides are prepared as described in U.S. Pat. No. 5,023,243, herein incorporated by reference.

[0204] Borano phosphate oligonucleotides are prepared as described in U.S. Pat. Nos. 5,130,302 and 5,177,198, both herein incorporated by reference.

Example 3

[0205] Oligonucleoside Synthesis

[0206] Methylenemethylimino linked oligonucleosides, also identified as MMI linked oligonucleosides, methylenedimethylhydrazo linked oligonucleosides, also identified as MDH linked oligonucleosides, and methylenecarbonylamino linked oligonucleosides, also identified as amide-3 linked oligonucleosides, and methyleneaminocarbonyl linked oligonucleosides, also identified as amide-4 linked oligonucleosides, as well as mixed backbone compounds having, for instance, alternating MMI and P═O or P═S linkages are prepared as described in U.S. Pat. Nos. 5,378,825, 5,386,023, 5,489,677, 5,602,240 and 5,610,289, all of which are herein incorporated by reference.

[0207] Formacetal and thioformacetal linked oligonucleosides are prepared as described in U.S. Pat. Nos. 5,264,562 and 5,264,564, herein incorporated by reference.

[0208] Ethylene oxide linked oligonucleosides are prepared as described in U.S. Pat. No. 5,223,618, herein incorporated by reference.

Example 4

[0209] PNA Synthesis

[0210] Peptide nucleic acids (PNAs) are prepared in accordance with any of the various procedures referred to in Peptide Nucleic Acids (PNA): Synthesis, Properties and Potential Applications, Bioorganic & Medicinal Chemistry, 1996, 4, 5-23. They may also be prepared in accordance with U.S. Pat. Nos. 5,539,082, 5,700,922, and 5,719,262, herein incorporated by reference.

Example 5

[0211] Synthesis of Chimeric Oligonucleotides

[0212] Chimeric oligonucleotides, oligonucleosides or mixed oligonucleotides/oligonucleosides of the invention can be of several different types. These include a first type wherein the “gap” segment of linked nucleosides is positioned between 5′ and 3′ “wing” segments of linked nucleosides and a second “open end” type wherein the “gap” segment is located at either the 3′ or the 5′ terminus of the oligomeric compound. Oligonucleotides of the first type are also known in the art as “gapmers” or gapped oligonucleotides. Oligonucleotides of the second type are also known in the art as “hemimers” or “wingmers”.

[0213] [2′-O-Me]-[2′-deoxy]-[2′-O-Me] Chimeric Phosphorothioate Oligonucleotides

[0214] Chimeric oligonucleotides having 2′-O-alkyl phosphorothioate and 2′-deoxy phosphorothioate oligonucleotide segments are synthesized using an Applied Biosystems automated DNA synthesizer Model 380B, as above. Oligonucleotides are synthesized using the automated synthesizer and 2′-deoxy-5′-dimethoxytrityl-3′-O-phosphoramidite for the DNA portion and 5′-dimethoxytrityl-2′-O-methyl-3′-O-phosphoramidite for 51 and 3′ wings. The standard synthesis cycle is modified by increasing the wait step after the delivery of tetrazole and base to 600 s repeated four times for RNA and twice for 2′-O-methyl. The fully protected oligonucleotide is cleaved from the support and the phosphate group is deprotected in 3:1 ammonia/ethanol at room temperature overnight then lyophilized to dryness. Treatment in methanolic ammonia for 24 hrs at room temperature is then done to deprotect all bases and sample was again lyophilized to dryness. The pellet is resuspended in 1M TBAF in THF for 24 hrs at room temperature to deprotect the 2′ positions. The reaction is then quenched with 1M TEAA and the sample is then reduced to ½ volume by rotovac before being desalted on a G25 size exclusion column. The oligo recovered is then analyzed spectrophotometrically for yield and for purity by capillary electrophoresis and by mass spectrometry.

[0215] [2′-O-(2-Methoxyethyl)]-[2′-deoxy]-[2′-O-(Methoxyethyl)] Chimeric Phosphorothioate Oligonucleotides

[0216] [2′-O-(2-methoxyethyl)]-[2′-deoxy]-[-2′-O-(methoxyethyl)] chimeric phosphorothioate oligonucleotides were prepared as per the procedure above for the 2′-O-methyl chimeric oligonucleotide, with the substitution of 2′-O-(methoxyethyl) amidites for the 2′-O-methyl amidites.

[0217] [2′-O-(2-Methoxyethyl)Phosphodiester]-[2′-deoxy Phosphorothioate]-[2′-O-(2-Methoxyethyl) Phosphodiester] Chimeric Oligonucleotides

[0218] [2′-O-(2-methoxyethyl phosphodiester]-[2′-deoxy phosphorothioate]-[2′-O-(methoxyethyl) phosphodiester] chimeric oligonucleotides are prepared as per the above procedure for the 2′-O-methyl chimeric oligonucleotide with the substitution of 2′-O-(methoxyethyl) amidites for the 2′-O-methyl amidites, oxidization with iodine to generate the phosphodiester internucleotide linkages within the wing portions of the chimeric structures and sulfurization utilizing 3,H-1,2 benzodithiole-3-one 1,1 dioxide (Beaucage Reagent) to generate the phosphorothioate internucleotide linkages for the center gap.

[0219] Other chimeric oligonucleotides, chimeric oligonucleosides and mixed chimeric oligonucleotides/oligonucleosides are synthesized according to U.S. Pat. No. 5,623,065, herein incorporated by reference.

Example 6

[0220] Oligonucleotide Isolation

[0221] After cleavage from the controlled pore glass column (Applied Biosystems) and deblocking in concentrated ammonium hydroxide at 55° C. for 18 hours, the oligonucleotides or oligonucleosides are purified by precipitation twice out of 0.5 M NaCl with 2.5 volumes ethanol. Synthesized oligonucleotides were analyzed by polyacrylamide gel electrophoresis on denaturing gels and judged to be at least 85% full length material. The relative amounts of phosphorothioate and phosphodiester linkages obtained in synthesis were periodically checked by ³¹P nuclear magnetic resonance spectroscopy, and for some studies oligonucleotides were purified by HPLC, as described by Chiang et al., J. Biol. Chem. 1991, 266, 18162-18171. Results obtained with HPLC-purified material were similar to those obtained with non-HPLC purified material.

Example 7

[0222] Oligonucleotide Synthesis—96 Well Plate Format

[0223] Oligonucleotides were synthesized via solid phase P(III) phosphoramidite chemistry on an automated synthesizer capable of assembling 96 sequences simultaneously in a standard 96 well format. Phosphodiester internucleotide linkages were afforded by oxidation with aqueous iodine. Phosphorothioate internucleotide linkages were generated by sulfurization utilizing 3,H-1,2 benzodithiole-3-one 1,1 dioxide (Beaucage Reagent) in anhydrous acetonitrile. Standard base-protected beta-cyanoethyldiisopropyl phosphoramidites were purchased from commercial vendors (e.g. PE-Applied Biosystems, Foster City, Calif., or Pharmacia, Piscataway, N.J.). Non-standard nucleosides are synthesized as per known literature or patented methods. They are utilized as base protected beta-cyanoethyldiisopropyl phosphoramidites.

[0224] Oligonucleotides were cleaved from support and deprotected with concentrated NH₄OH at elevated temperature (55-60° C.) for 12-16 hours and the released product then dried in vacuo. The dried product was then re-suspended in sterile water to afford a master plate from which all analytical and test plate samples are then diluted utilizing robotic pipettors.

Example 8

[0225] Oligonucleotide Analysis—96 Well Plate Format

[0226] The concentration of oligonucleotide in each well was assessed by dilution of samples and UV absorption spectroscopy. The full-length integrity of the individual products was evaluated by capillary electrophoresis (CE) in either the 96 well format (Beckman P/ACE™ MDQ) or, for individually prepared samples, on a commercial CE apparatus (e.g., Beckman P/ACE™ 5000, ABI 270). Base and backbone composition was confirmed by mass analysis of the compounds utilizing electrospray-mass spectroscopy. All assay test plates were diluted from the master plate using single and multi-channel robotic pipettors. Plates were judged to be acceptable if at least 85% of the compounds on the plate were at least 85% full length.

Example 9

[0227] Cell Culture and Oligonucleotide Treatment

[0228] The effect of antisense compounds on target nucleic acid expression can be tested in any of a variety of cell types provided that the target nucleic acid is present at measurable levels. This can be routinely determined using, for example, PCR or Northern blot analysis. The following 4 cell types are provided for illustrative purposes, but other cell types can be routinely used, provided that the target is expressed in the cell type chosen. This can be readily determined by methods routine in the art, for example Northern blot analysis, Ribonuclease protection assays, or RT-PCR.

[0229] T-24 Cells:

[0230] The human transitional cell bladder carcinoma cell line T-24 was obtained from the American Type Culture Collection (ATCC) (Manassas, Va.). T-24 cells were routinely cultured in complete McCoy's 5A basal media (Invitrogen Corporation, Carlsbad, Calif.) supplemented with 10% fetal calf serum ((Invitrogen Corporation, Carlsbad, Calif.), penicillin 100 units per mL, and streptomycin 100 micrograms per mL (Invitrogen Corporation, Carlsbad, Calif.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence. Cells were seeded into 96-well plates (Falcon-Primaria #3872) at a density of 7000 cells/well for use in RT-PCR analysis.

[0231] For Northern blotting or other analysis, cells may be seeded onto 100 mm or other standard tissue culture plates and treated similarly, using appropriate volumes of medium and oligonucleotide.

[0232] A549 Cells:

[0233] The human lung carcinoma cell line A549 was obtained from the American Type Culture Collection (ATCC) (Manassas, Va.). A549 cells were routinely cultured in DMEM basal media (Invitrogen Corporation, Carlsbad, Calif.) supplemented with 10% fetal calf serum (Invitrogen Corporation, Carlsbad, Calif.), penicillin 100 units per mL, and streptomycin 100 micrograms per mL (Invitrogen Corporation, Carlsbad, Calif.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence.

[0234] NHDF Cells:

[0235] Human neonatal dermal fibroblast (NHDF) were obtained from the Clonetics Corporation (Walkersville, Md.). NHDFs were routinely maintained in Fibroblast Growth Medium (Clonetics Corporation, Walkersville, Md.) supplemented as recommended by the supplier. Cells were maintained for up to 10 passages as recommended by the supplier.

[0236] HEK Cells:

[0237] Human embryonic keratinocytes (HEK) were obtained from the Clonetics Corporation (Walkersville, Md.). HEKs were routinely maintained in Keratinocyte Growth Medium (Clonetics Corporation, Walkersville, Md.) formulated as recommended by the supplier. Cells were routinely maintained for up to 10 passages as recommended by the supplier.

[0238] Treatment with Antisense Compounds:

[0239] When cells reached 70% confluency, they were treated with oligonucleotide. For cells grown in 96-well plates, wells were washed once with 100 μL OPTI-MEM™-1 reduced-serum medium (Invitrogen Corporation, Carlsbad, Calif.) and then treated with 130 μL of OPTI-MEM™-1 containing 3.75 μg/mL LIPOFECTIN™ (Invitrogen Corporation, Carlsbad, Calif.) and the desired concentration of oligonucleotide. After 4-7 hours of treatment, the medium was replaced with fresh medium. Cells were harvested 16-24 hours after oligonucleotide treatment.

[0240] The concentration of oligonucleotide used varies from cell line to cell line. To determine the optimal oligonucleotide concentration for a particular cell line, the cells are treated with a positive control oligonucleotide at a range of concentrations. For human cells the positive control oligonucleotide is ISIS 13920, TCCGTCATCGCTCCTCAGGG, SEQ ID NO: 1, a 2′-O-methoxyethyl gapmer (2′-O-methoxyethyls shown in bold) with a phosphorothioate backbone which is targeted to human H-ras. For mouse or rat cells the positive control oligonucleotide is ISIS 15770, ATGCATTCTGCCCCCAAGGA, SEQ ID NO: 2, a 2′-O-methoxyethyl gapmer (2′-O-methoxyethyls shown in bold) with a phosphorothioate backbone which is targeted to both mouse and rat c-raf. The concentration of positive control oligonucleotide that results in 80% inhibition of c-Ha-ras (for ISIS 13920) or c-raf (for ISIS 15770) mRNA is then utilized as the screening concentration for new oligonucleotides in subsequent experiments for that cell line. If 80% inhibition is not achieved, the lowest concentration of positive control oligonucleotide that results in 60% inhibition of H-ras or c-raf mRNA is then utilized as the oligonucleotide screening concentration in subsequent experiments for that cell line. If 60% inhibition is not achieved, that particular cell line is deemed as unsuitable for oligonucleotide transfection experiments.

Example 10

[0241] Analysis of Oligonucleotide Inhibition of NOD1 Expression

[0242] Antisense modulation of NOD1 expression can be assayed in a variety of ways known in the art. For example, NOD1 mRNA levels can be quantitated by, e.g., Northern blot analysis, competitive polymerase chain reaction (PCR), or real-time PCR (RT-PCR). Real-time quantitative PCR is presently preferred. RNA analysis can be performed on total cellular RNA or poly(A)+ mRNA. The preferred method of RNA analysis of the present invention is the use of total cellular RNA as described in other examples herein. Methods of RNA isolation are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.1.1-4.2.9 and 4.5.1-4.5.3, John Wiley & Sons, Inc., 1993. Northern blot analysis is routine in the art and is taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.2.1-4.2.9, John Wiley & Sons, Inc., 1996. Real-time quantitative (PCR) can be conveniently accomplished using the commercially available ABI PRISM™ 7700 Sequence Detection System, available from PE-Applied Biosystems, Foster City, Calif. and used according to manufacturer's instructions.

[0243] Protein levels of NOD1 can be quantitated in a variety of ways well known in the art, such as immunoprecipitation, Western blot analysis (immunoblotting), ELISA or fluorescence-activated cell sorting (FACS). Antibodies directed to NOD1 can be identified and obtained from a variety of sources, such as the MSRS catalog of antibodies (Aerie Corporation, Birmingham, Mich.), or can be prepared via conventional antibody generation methods. Methods for preparation of polyclonal antisera are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.12.1-11.12.9, John Wiley & Sons, Inc., 1997. Preparation of monoclonal antibodies is taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.4.1-11.11.5, John Wiley & Sons, Inc., 1997.

[0244] Immunoprecipitation methods are standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 10.16.1-10.16.11, John Wiley & Sons, Inc., 1998. Western blot (immunoblot) analysis is standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 10.8.1-10.8.21, John Wiley & Sons, Inc., 1997. Enzyme-linked immunosorbent assays (ELISA) are standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.2.1-11.2.22, John Wiley & Sons, Inc., 1991.

Example 11

[0245] Poly(A)+ mRNA Isolation

[0246] Poly(A)+ mRNA was isolated according to Miura et al., Clin. Chem., 1996, 42, 1758-1764. Other methods for poly(A)+ mRNA isolation are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.5.1-4.5.3, John Wiley & Sons, Inc., 1993. Briefly, for cells grown on 96-well plates, growth medium was removed from the cells and each well was washed with 200 μL cold PBS. 60 μL lysis buffer (10 mM Tris-HCl, pH 7.6, 1 mM EDTA, 0.5 M NaCl, 0.5% NP-40, 20 mM vanadyl-ribonucleoside complex) was added to each well, the plate was gently agitated and then incubated at room temperature for five minutes. 55 μL of lysate was transferred to Oligo d(T) coated 96-well plates (AGCT Inc., Irvine Calif.). Plates were incubated for 60 minutes at room temperature, washed 3 times with 200 μL of wash buffer (10 mM Tris-HCl pH 7.6, 1 mM EDTA, 0.3 M NaCl). After the final wash, the plate was blotted on paper towels to remove excess wash buffer and then air-dried for 5 minutes. 60 μL of elution buffer (5 mM Tris-HCl pH 7.6), preheated to 70° C. was added to each well, the plate was incubated on a 90° C. hot plate for 5 minutes, and the eluate was then transferred to a fresh 96-well plate.

[0247] Cells grown on 100 mm or other standard plates may be treated similarly, using appropriate volumes of all solutions.

Example 12

[0248] Total RNA Isolation

[0249] Total RNA was isolated using an RNEASY96™ kit and buffers purchased from Qiagen Inc. (Valencia, Calif.) following the manufacturer's recommended procedures. Briefly, for cells grown on 96-well plates, growth medium was removed from the cells and each well was washed with 200 μL cold PBS. 150 μL Buffer RLT was added to each well and the plate vigorously agitated for 20 seconds. 150 μL of 70% ethanol was then added to each well and the contents mixed by pipetting three times up and down. The samples were then transferred to the RNEASY₉₆™ well plate attached to a QIAVAC™ manifold fitted with a waste collection tray and attached to a vacuum source. Vacuum was applied for 1 minute. 500 μL of Buffer RW1 was added to each well of the RNEASY 96™ plate and incubated for 15 minutes and the vacuum was again applied for 1 minute. An additional 500 μL of Buffer RW1 was added to each well of the RNEASY 96™ plate and the vacuum was applied for 2 minutes. 1 mL of Buffer RPE was then added to each well of the RNEASY 96™ plate and the vacuum applied for a period of 90 seconds. The Buffer RPE wash was then repeated and the vacuum was applied for an additional 3 minutes. The plate was then removed from the QIAVAC™ manifold and blotted dry on paper towels. The plate was then re-attached to the QIAVAC™ manifold fitted with a collection tube rack containing 1.2 mL collection tubes. RNA was then eluted by pipetting 170 μL water into each well, incubating 1 minute, and then applying the vacuum for 3 minutes.

[0250] The repetitive pipetting and elution steps may be automated using a QIAGEN Bio-Robot 9604 (Qiagen, Inc., Valencia Calif.). Essentially, after lysing of the cells on the culture plate, the plate is transferred to the robot deck where the pipetting, DNase treatment and elution steps are carried out.

Example 13

[0251] Real-Time Quantitative PCR Analysis of NOD1 mRNA Levels

[0252] Quantitation of NOD1 mRNA levels was determined by real-time quantitative PCR using the ABI PRISM™ 7700 Sequence Detection System (PE-Applied Biosystems, Foster City, Calif.) according to manufacturer's instructions. This is a closed-tube, non-gel-based, fluorescence detection system which allows high-throughput quantitation of polymerase chain reaction (PCR) products in real-time. As opposed to standard PCR, in which amplification products are quantitated after the PCR is completed, products in real-time quantitative PCR are quantitated as they accumulate. This is accomplished by including in the PCR reaction an oligonucleotide probe that anneals specifically between the forward and reverse PCR primers, and contains two fluorescent dyes. A reporter dye (e.g., FAM, obtained from either Operon Technologies Inc., Alameda, Calif. or Integrated DNA Technologies Inc., Coralville, Iowa) is attached to the 5′ end of the probe and a quencher dye (e.g., TAMRA, obtained from either Operon Technologies Inc., Alameda, Calif. or Integrated DNA Technologies Inc., Coralville, Iowa) is attached to the 3′ end of the probe. When the probe and dyes are intact, reporter dye emission is quenched by the proximity of the 3′ quencher dye. During amplification, annealing of the probe to the target sequence creates a substrate that can be cleaved by the 5′-exonuclease activity of Taq polymerase. During the extension phase of the PCR amplification cycle, cleavage of the probe by Taq polymerase releases the reporter dye from the remainder of the probe (and hence from the quencher moiety) and a sequence-specific fluorescent signal is generated. With each cycle, additional reporter dye molecules are cleaved from their respective probes, and the fluorescence intensity is monitored at regular intervals by laser optics built into the ABI PRISM™ 7700 Sequence Detection System. In each assay, a series of parallel reactions containing serial dilutions of mRNA from untreated control samples generates a standard curve that is used to quantitate the percent inhibition after antisense oligonucleotide treatment of test samples.

[0253] Prior to quantitative PCR analysis, primer-probe sets specific to the target gene being measured are evaluated for their ability to be “multiplexed” with a GAPDH amplification reaction. In multiplexing, both the target gene and the internal standard gene GAPDH are amplified concurrently in a single sample. In this analysis, mRNA isolated from untreated cells is serially diluted. Each dilution is amplified in the presence of primer-probe sets specific for GAPDH only, target gene only (“single-plexing”), or both (multiplexing). Following PCR amplification, standard curves of GAPDH and target mRNA signal as a function of dilution are generated from both the single-plexed and multiplexed samples. If both the slope and correlation coefficient of the GAPDH and target signals generated from the multiplexed samples fall within 10% of their corresponding values generated from the single-plexed samples, the primer-probe set specific for that target is deemed multiplexable. Other methods of PCR are also known in the art.

[0254] PCR reagents were obtained from Invitrogen, Carlsbad, Calif. RT-PCR reactions were carried out by adding 20 μL PCR cocktail (2.5×PCR buffer (—MgCl2), 6.6 mM MgCl2, 375 μM each of dATP, dCTP, dCTP and dGTP, 375 nM each of forward primer and reverse primer, 125 nM of probe, 4 Units RNAse inhibitor, 1.25 Units PLATINUM® Taq, 5 Units MuLV reverse transcriptase, and 2.5×ROX dye) to 96 well plates containing 30 μL total RNA solution. The RT reaction was carried out by incubation for 30 minutes at 48° C. Following a 10 minute incubation at 95° C. to activate the PLATINUM® Taq, 40 cycles of a two-step PCR protocol were carried out: 95° C. for 15 seconds (denaturation) followed by 60° C. for 1.5 minutes (annealing/extension).

[0255] Gene target quantities obtained by real time RT-PCR are normalized using either the expression level of GAPDH, a gene whose expression is constant, or by quantifying total RNA using RiboGreen™ (Molecular Probes, Inc. Eugene, Oreg.). GAPDH expression is quantified by real time RT-PCR, by being run simultaneously with the target, multiplexing, or separately. Total RNA is quantified using RiboGreen™ RNA quantification reagent from Molecular Probes. Methods of RNA quantification by RiboGreen™ are taught in Jones, L. J., et al, Analytical Biochemistry, 1998, 265, 368-374.

[0256] In this assay, 170 μL of RiboGreen™ working reagent (RiboGreen™ reagent diluted 1:350 in 10 mM Tris-HCl, 1 mM EDTA, pH 7.5) is pipetted into a 96-well plate containing 30 μL purified, cellular RNA. The plate is read in a CytoFluor 4000 (PE Applied Biosystems) with excitation at 480 nm and emission at 520 nm.

[0257] Probes and primers to human NOD1 were designed to hybridize to a human NOD1 sequence, using published sequence information (GenBank accession number NM_(—)006092.1, incorporated herein as SEQ ID NO:3). For human NOD1 the PCR primers were:

[0258] forward primer: GCAGGCGGGACTATCAGGA (SEQ ID NO: 4)

[0259] reverse primer: AGTTTGCCGACCAGACCTTCT (SEQ ID NO: 5) and the PCR probe was: FAM-TCCACTGCCTCCATGATGCAAGCC-TAMRA (SEQ ID NO: 6) where FAM (PE-Applied Biosystems, Foster City, Calif.) is the fluorescent reporter dye) and TAMRA (PE-Applied Biosystems, Foster City, Calif.) is the quencher dye. For human GAPDH the PCR primers were:

[0260] forward primer: GAAGGTGAAGGTCGGAGTC(SEQ ID NO:7)

[0261] reverse primer: GAAGATGGTGATGGGATTTC (SEQ ID NO:8) and the PCR probe was: 5′ JOE-CAAGCTTCCCGTTCTCAGCC-TAMRA 3′ (SEQ ID NO: 9) where JOE (PE-Applied Biosystems, Foster City, Calif.) is the fluorescent reporter dye) and TAMRA (PE-Applied Biosystems, Foster City, Calif.) is the quencher dye.

Example 14

[0262] Northern Blot Analysis of NOD1 mRNA Levels

[0263] Eighteen hours after antisense treatment, cell monolayers were washed twice with cold PBS and lysed in 1 mL RNAZOL™ (TEL-TEST “B” Inc., Friendswood, Tex.). Total RNA was prepared following manufacturer's recommended protocols. Twenty micrograms of total RNA was fractionated by electrophoresis through 1.2% agarose gels containing 1.1% formaldehyde using a MOPS buffer system (AMRESCO, Inc. Solon, Ohio). RNA was transferred from the gel to HYBONDTM-N+nylon membranes (Amersham Pharmacia Biotech, Piscataway, N.J.) by overnight capillary transfer using a Northern/Southern Transfer buffer system (TEL-TEST “B” Inc., Friendswood, Tex.). RNA transfer was confirmed by UV visualization. Membranes were fixed by UV cross-linking using a STRATALINKER™ UV Crosslinker 2400 (Stratagene, Inc, La Jolla, Calif.) and then probed using QUICKHYB™ hybridization solution (Stratagene, La Jolla, Calif.) using manufacturer's recommendations' for stringent conditions.

[0264] To detect human NOD1, a human NOD1 specific probe was prepared by PCR using the forward primer GCAGGCGGGACTATCAGGA (SEQ ID NO: 4) and the reverse primer AGTTTGCCGACCAGACCTTCT (SEQ ID NO: 5). To normalize for variations in loading and transfer efficiency membranes were stripped and probed for human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) RNA (Clontech, Palo Alto, Calif.).

[0265] Hybridized membranes were visualized and quantitated using a PHOSPHORIMAGER™ and IMAGEQUANT™ Software V3.3 (Molecular Dynamics, Sunnyvale, Calif.). Data was normalized to GAPDH levels in untreated controls.

Example 15

[0266] Antisense Inhibition of Human NOD1 Expression by Chimeric Phosphorothioate Oligonucleotides Having 2′-MOE Wings and a Deoxy Gap

[0267] In accordance with the present invention, a series of oligonucleotides were designed to target different regions of the human NOD1 RNA, using published sequences (GenBank accession number NM_(—)006092.1, incorporated herein as SEQ ID NO: 3; residues 17001-38580 from GenBank accession number AF149773.1, representing a partial genomic sequence of NOD1, incorporated herein as SEQ ID NO: 10; GenBank accession number AK023969.1, representing the NOD1 variant CARD4-X, incorporated herein as SEQ ID NO: 11; GenBank accession number BE928388.1, representing a partial coding sequence and 3′UTR of NOD1, the complement of which is incorporated herein as SEQ ID NO: 12; GenBank accession number BG985065.1, representing a partial sequence of NOD1 which includes an exon:exon junction, incorporated herein as SEQ ID NO: 14, and residues 521000-3555000 of GenBank accession number NT_(—)007897.4, representing a partial genomic sequence of NOD1, the complement of which is incorporated herein as SEQ ID NO: 15). The oligonucleotides are shown in Table 1. “Target site” indicates the first (5′-most) nucleotide number on the particular target sequence to which the oligonucleotide binds. All compounds in Table 1 are chimeric oligonucleotides (“gapmers”) 20 nucleotides in length, composed of a central “gap” region consisting of ten 2′-deoxynucleotides, which is flanked on both sides (5′ and 3′ directions) by five-nucleotide “wings”. The wings are composed of 2′-methoxyethyl (2′-MOE)nucleotides. The internucleoside (backbone) linkages are phosphorothioate (P=S) throughout the oligonucleotide. All cytidine residues are 5-methylcytidines. The compounds were analyzed for their effect on human NOD1 mRNA levels by quantitative real-time PCR as described in other examples herein. Data are averages from two experiments. If present, “N.D.” indicates “no data”. TABLE 1 Inhibition of human NOD1 mRNA levels by chimeric phosphorothioate oligonucleotides having 2′-MOE wings and a deoxy gap TAR- GET SEQ TAR- % SEQ RE- ID GET IN- ID ISIS # GION NO SITE SEQUENCE HIB NO 199141 5′UTR 3  41 cctgacttacaatcacttgg 23 16 199142 5′UTR 3  156 agcaacttgtcttcccagac 81 17 199143 5′UTR 3  198 aattgcttctgtctcttcca 29 18 199144 5′UTR 3  236 aacattgtttaaatcttcaa 37 19 199145 5′UTR 3  354 gtcctctcagcagaagggca 76 20 199146 Start 3  417 ctgctcttccatagttaaag 17 21 Codon 199147 Coding 3  449 tctgatgggattatttccat 79 22 199148 Coding 3  496 ccagaagttcccgattgctt 90 23 199149 Coding 3  616 ttttgcggaccttgtcaggc 8 24 199150 Coding 3  800 tgctgggtatacctgctcac 7 25 199151 Coding 3  983 tcattgaggatgccggtggt 66 26 199152 Coding 3 1013 tcacccaggatgaagatggt 69 27 199153 Coding 3 1272 cagctcgtccaggccatcga 55 28 199154 Coding 3 1291 tcaggtccaagtccgagtgc 58 29 199155 Coding 3 1384 tagcccccttgagcagcttc 70 30 199156 Coding 3 1416 ctcgatgcctgtgcgggctg 75 31 199157 Coding 3 1541 tccagctggctcagcaggcg 0 32 199158 Coding 3 1619 cggaagtgctggaagcaccg 6 33 199159 Coding 3 1713 catcctgttcagatggacct 61 34 199160 Coding 3 1731 caccaggctgctgggctgca 61 35 199161 Coding 3 1787 cacagagtgtcccggccggc 71 36 199162 Coding 3 1798 gccccagcgagcacagagtg 55 37 199163 Coding 3 1858 cctgcacctcctcctgggtg 63 38 199164 Coding 3 2005 tgtcgtccagcacgaggaag 16 39 199165 Coding 3 2072 gtggtcgctgcccccgcagg 0 40 199166 Coding 3 2079 gcaggacgtggtcgctgccc 64 41 199167 Coding 3 2162 tggaagtgatccttgttctt 72 42 199168 Coding 3 2220 ccgcaggagtttctgtttgg 42 43 199169 Coding 3 2261 gccttgcgctttctcctcag 34 44 199170 Coding 3 2293 cccgcaggctggaaaacagg 66 45 199171 Coding 3 2462 ttgcagtaggtcagcttgag 72 46 199172 Coding 3 2580 gggctgcagctcccgcacgc 16 47 199173 Coding 3 2601 aacagtgaggcggctgaagc 0 48 199174 Coding 3 2707 tctggttgttgtataaaccc 0 49 199175 Coding 3 2841 tttgctgttcttcacagcca 55 50 199176 Coding 3 2920 ggttccgcagagcctctgcg 65 51 199177 Coding 3 2952 ggacgcaagactcagggtgg 12 52 199178 Coding 3 2992 ccctcgcaaggctctttcct 60 53 199179 Coding 3 3120 ctggataagccataaatgct 54 54 199180 Coding 3 3193 aaatctctgttatgccagtg 61 55 199181 Coding 3 3204 tccatttaggcaaatctctg 26 56 199182 Coding 3 3225 ctcctctggttttatcaggt 76 57 199183 Stop 3 3276 agcatcctctcagaaacaga 72 58 Codon 199184 3′UTR 3 3302 cagggcaaaaaccccatgaa 66 59 199185 3′UTR 3 3370 gcctgcgcaggcccctttaa 34 60 199186 3′UTR 3 3524 gtctcttcacagtgtattta 64 61 199187 3′UTR 3 3592 gcctcctctgtttgctcaca 67 62 199188 3′UTR 3 3606 atgaggtgaggctggcctcc 64 63 199189 3′UTR 3 3689 caactttgtgccacatcctc 29 64 199190 3′UTR 3 3788 gggaccagtatcaatacatg 62 65 199191 3′UTR 3 3878 tgcatcctttccttgacatt 63 66 199192 3′UTR 3 3923 gccagcagacagtgagactc 33 67 199193 3′UTR 3 3948 agagctgagggcattctcta 51 68 199194 3′UTR 3 3989 gaagcagcattttgaaggca 34 69 199195 3′UTR 3 4039 tactcagccttctagaggag 6 70 199196 3′UTR 3 4195 cagctgcttgggaggcagga 50 71 199197 3′UTR 3 4250 gctggcccagttcccagccg 61 72 199198 3′UTR 3 4356 ttaacaaatgagcgggcaag 0 73 199199 3′UTR 3 4367 tcagtattttattaacaaat 34 74 199200 Exon: 10 15368  agatacacactcactcagtg 20 75 Intron Junction 199201 Exon: 11 2760 tttccaaagtcccaaatagg 11 76 Exon Junction 199202 Exon: 11 2894 ttttcccagtctggctccga 12 77 Exon Junction 199203 3′UTR 12  231 tcccagctcttcactcagac 25 78 199204 Intron 15 24591  ggctccaagcccagcctgtc 54 79 11 199205 Intron 15 24651  tcagcccaaggctctccagc 0 80 11 199206 Intron 15 24705  actcaagagagcctacttgg 38 81 11 199207 Exon: 14  45 ttacaatcactcagtgtcac 61 82 Exon Junction 199208 Exon 2 10 18943  aagtctcctgacttacaatc 64 83 199209 Intron: 15 9537 tacgctgagtctgaaataaa 20 84 Exon Junction 199210 Intron: 15 10770  tttccaaagtctgggttgaa 78 85 Exon Junction 199211 Exon 8 15 10914  ccaggattttggtgacgtac 61 86 199212 Intron: 15 21837  ggacgcaagactaggaagga 49 87 Exon Junction 199213 Exon: 15 21922  cgagctattaccacagtatt 79 88 Intron Junction 199214 Intron 15 22517  ctgaaggtatccaaggatac 0 89 11 199215 Exon 15 29415  actttcataccatgcacatt 54 90 Intron Junction 199216 Intron 15 29478  ccaacacttatgcaaacaca 58 91 14 199217 Intron 15 31291  caaactgttcactgactttc 0 92 14 199218 Intron: 15 32208  tccatttaggctgttgaaaa 32 93 Exon Junction

[0268] As shown in Table 1, SEQ ID NOs 17, 20, 22, 23, 26, 27, 28, 29, 30, 31, 34, 35, 36, 37, 38, 41, 42, 45, 46, 50, 51, 53, 54, 55, 57, 58, 59, 61, 62, 63, 65, 66, 68, 71, 72, 79, 82, 83, 85, 86, 87, 88, 90 and 91 demonstrated at least 48% inhibition of human NOD1 expression in this assay and are therefore preferred. The target sites to which these preferred sequences are complementary are herein referred to as “active sites” and are therefore preferred sites for targeting by compounds of the present invention.

Example 16

[0269] Western Blot Analysis of NOD1 Protein Levels

[0270] Western blot analysis (immunoblot analysis) is carried out using standard methods. Cells are harvested 16-20 h after oligonucleotide treatment, washed once with PBS, suspended in Laemmli buffer (100 ul/well), boiled for 5 minutes and loaded on a 16% SDS-PAGE gel. Gels are run for 1.5 hours at 150 V, and transferred to membrane for western blotting. Appropriate primary antibody directed to NOD1 is used, with a radiolabelled or fluorescently labeled secondary antibody directed against the primary antibody species. Bands are visualized using a PHOSPHORIMAGER™ (Molecular Dynamics, Sunnyvale Calif.).

Example 17

[0271] It is advantageous to selectively inhibit the expression of one or more variants of NOD1. Consequently, in one embodiment of the present invention are oligonucleotides that selectively target, hybridize to, and specifically inhibit one or more, but fewer than all of the variants of NOD1. A summary of the target sites of the variants is shown in Table 2 and includes NOD1 mRNA (CARD4-L), incorporated herein as SEQ ID NO: 3, GenBank accession number AX082223.1, representing the NOD1 variant CARD4-S, incorporated herein as SEQ ID NO: 94, the NOD1 variant CARD4-X, incorporated herein as SEQ ID NO: 10, GenBank Accession number AX082236.1, representing the NOD1 variant CARD4-Y, incorporated herein as SEQ ID NO: 95 and GenBank accession number AX082238.1, representing the NOD1 variant CARD4-Z, incorporated herein as SEQ ID NO: 96. TABLE 2 Targeting of individual oligonucleotides to specific variants of NOD1 OLIGO SEQ VARIANT ISIS # ID NO. TARGET SITE VARIANT SEQ ID NO. 199141 16 41 CARD4-L 3 199141 16 130 CARD4-X 11 199141 16 83 CARD4-Y 95 199141 16 134 CARD4-Z 96 199142 17 156 CARD4-L 3 199143 18 198 CARD4-L 3 199143 18 258 CARD4-X 11 199143 18 211 CARD4-Y 95 199143 18 262 CARD4-Z 96 199144 19 236 CARD4-L 3 199144 19 296 CARD4-X 11 199144 19 249 CARD4-Y 95 199144 19 300 CARD4-Z 96 199145 20 354 CARD4-L 3 199145 20 414 CARD4-X 11 199145 20 367 CARD4-Y 95 199145 20 418 CARD4-Z 96 199146 21 417 CARD4-L 3 199146 21 477 CARD4-X 11 199146 21 430 CARD4-Y 95 199146 21 481 CARD4-Z 96 199147 22 449 CARD4-L 3 199147 22 509 CARD4-X 11 199147 22 462 CARD4-Y 95 199147 22 513 CARD4-Z 96 199148 23 496 CARD4-L 3 199148 23 556 CARD4-X 11 199148 23 509 CARD4-Y 95 199148 23 560 CARD4-Z 96 199150 25 800 CARD4-L 3 199150 25 860 CARD4-X 11 199150 25 259 CARD4-S 94 199150 25 813 CARD4-Y 95 199151 26 983 CARD4-L 3 199151 26 1043 CARD4-X 11 199151 26 442 CARD4-S 94 199151 26 996 CARD4-Y 95 199152 27 1013 CARD4-L 3 199152 27 1073 CARD4-X 11 199152 27 472 CARD4-S 94 199153 28 1272 CARD4-L 3 199153 28 731 CARD4-S 94 199153 28 1149 CARD4-Y 95 199154 29 1291 CARD4-L 3 199154 29 1351 CARD4-X 11 199154 29 750 CARD4-S 94 199154 29 1168 CARD4-Y 95 199155 30 1384 CARD4-L 3 199155 30 1444 CARD4-X 11 199155 30 843 CARD4-S 94 199155 30 1261 CARD4-Y 95 199156 31 1416 CARD4-L 3 199156 31 1476 CARD4-X 11 199156 31 875 CARD4-S 94 199156 31 1293 CARD4-Y 95 199157 32 1541 CARD4-L 3 199157 32 1601 CARD4-X 11 199157 32 1000 CARD4-S 94 199157 32 1418 CARD4-Y 95 199158 33 1619 CARD4-L 3 199158 33 1679 CARD4-X 11 199158 33 1078 CARD4-S 94 199158 33 1496 CARD4-Y 95 199159 34 1713 CARD4-L 3 199159 34 1773 CARD4-X 11 199159 34 1172 CARD4-S 94 199159 34 1590 CARD4-Y 95 199160 35 1731 CARD4-L 3 199160 35 1791 CARD4-X 11 199160 35 1190 CARD4-S 94 199160 35 1608 CARD4-Y 95 199161 36 1787 CARD4-L 3 199161 36 1847 CARD4-X 11 199161 36 1246 CARD4-S 94 199161 36 1664 CARD4-Y 95 199162 37 1798 CARD4-L 3 199162 37 1858 CARD4-X 11 199162 37 1257 CARD4-S 94 199162 37 1675 CARD4-Y 95 199163 38 1858 CARD4-L 3 199163 38 1918 CARD4-X 11 199163 38 1317 CARD4-S 94 199163 38 1735 CARD4-Y 95 199164 39 2005 CARD4-L 3 199164 39 2065 CARD4-X 11 199164 39 1882 CARD4-Y 95 199165 40 2072 CARD4-L 3 199165 40 2132 CARD4-X 11 199165 40 1949 CARD4-Y 95 199166 41 2079 CARD4-L 3 199166 41 2139 CARD4-X 11 199166 41 1956 CARD4-Y 95 199167 42 2162 CARD4-L 3 199167 42 2222 CARD4-X 11 199167 42 2039 CARD4-Y 95 199168 43 2220 CARD4-L 3 199168 43 2280 CARD4-X 11 199168 43 2097 CARD4-Y 95 199169 44 2261 CARD4-L 3 199169 44 2321 CARD4-X 11 199169 44 2138 CARD4-Y 95 199170 45 2293 CARD4-L 3 199170 45 2353 CARD4-X 11 199170 45 2170 CARD4-Y 95 199171 46 2462 CARD4-L 3 199171 46 2522 CARD4-X 11 199171 46 2339 CARD4-Y 95 199172 47 2580 CARD4-L 3 199172 47 2640 CARD4-X 11 199172 47 2457 CARD4-Y 95 199173 48 2601 CARD4-L 3 199173 48 2661 CARD4-X 11 199173 48 2478 CARD4-Y 95 199174 49 2707 CARD4-L 3 199174 49 2584 CARD4-Y 95 199175 50 2841 CARD4-L 3 199175 50 2951 CARD4-X 11 199175 50 1655 CARD4-S 94 199175 50 2718 CARD4-Y 95 199176 51 2920 CARD4-L 3 199176 51 1734 CARD4-S 94 199176 51 2797 CARD4-Y 95 199177 52 2952 CARD4-L 3 199177 52 3062 CARD4-X 11 199177 52 1766 CARD4-S 94 199177 52 2829 CARD4-Y 95 199178 53 2992 CARD4-L 3 199178 53 3102 CARD4-X 11 199178 53 1806 CARD4-S 94 199178 53 2869 CARD4-Y 95 199179 54 3120 CARD4-L 3 199179 54 3230 CARD4-X 11 199179 54 1934 CARD4-S 94 199179 54 2997 CARD4-Y 95 199180 55 3193 CARD4-L 3 199180 55 3303 CARD4-X 11 199180 55 3070 CARD4-Y 95 199181 56 3204 CARD4-L 3 199181 56 3081 CARD4-Y 95 199182 57 3225 CARD4-L 3 199182 57 3441 CARD4-X 11 199182 57 3102 CARD4-Y 95 199183 58 3276 CARD4-L 3 199183 58 3492 CARD4-X 11 199183 58 3153 CARD4-Y 95 199184 59 3302 CARD4-L 3 199184 59 3518 CARD4-X 11 199184 59 3179 CARD4-Y 95 199185 60 3370 CARD4-L 3 199185 60 3586 CARD4-X 11 199185 60 1984 CARD4-S 94 199185 60 3247 CARD4-Y 95 199186 61 3524 CARD4-L 3 199186 61 3740 CARD4-X 11 199186 61 2138 CARD4-S 94 199186 61 3401 CARD4-Y 95 199187 62 3592 CARD4-L 3 199187 62 3808 CARD4-X 11 199187 62 2206 CARD4-S 94 199187 62 3469 CARD4-Y 95 199188 63 3606 CARD4-L 3 199188 63 3822 CARD4-X 11 199188 63 2220 CARD4-S 94 199188 63 3483 CARD4-Y 95 199189 64 3689 CARD4-L 3 199189 64 3905 CARD4-X 11 199189 64 2303 CARD4-S 94 199189 64 3566 CARD4-Y 95 199190 65 3788 CARD4-L 3 199190 65 4004 CARD4-X 11 199190 65 2402 CARD4-S 94 199190 65 3665 CARD4-Y 95 199191 66 3878 CARD4-L 3 199191 66 4094 CARD4-X 11 199191 66 2492 CARD4-S 94 199191 66 3755 CARD4-Y 95 199192 67 3923 CARD4-L 3 199192 67 4139 CARD4-X 11 199192 67 2537 CARD4-S 94 199192 67 3800 CARD4-Y 95 199193 68 3948 CARD4-L 3 199193 68 4164 CARD4-X 11 199193 68 2562 CARD4-S 94 199193 68 3825 CARD4-Y 95 199194 69 3989 CARD4-L 3 199194 69 4205 CARD4-X 11 199194 69 2603 CARD4-S 94 199194 69 3866 CARD4-Y 95 199195 70 4039 CARD4-L 3 199195 70 4255 CARD4-X 11 199195 70 2653 CARD4-S 94 199195 70 3916 CARD4-Y 95 199196 71 4195 CARD4-L 3 199196 71 4411 CARD4-X 11 199196 71 2809 CARD4-S 94 199196 71 4072 CARD4-Y 95 199197 72 4250 CARD4-L 3 199197 72 4466 CARD4-X 11 199197 72 2864 CARD4-S 94 199197 72 4127 CARD4-Y 95 199198 73 4356 CARD4-L 3 199199 74 4367 CARD4-L 3 199199 74 4583 CARD4-X 11 199199 74 2981 CARD4-S 94 199199 74 4244 CARD4-Y 95 199201 76 2760 CARD4-X 11 199202 77 2894 CARD4-X 11 199202 77 1598 CARD4-S 94 199208 83 47 CARD4-L 3 199208 83 136 CARD4-X 11 199208 83 89 CARD4-Y 95 199208 83 140 CARD4-Z 96 199211 86 2746 CARD4-L 3 199211 86 2623 CARD4-Y 95

[0272]

1 96 1 20 DNA Artificial Sequence Antisense Oligonucleotide 1 tccgtcatcg ctcctcaggg 20 2 20 DNA Artificial Sequence Antisense Oligonucleotide 2 atgcattctg cccccaagga 20 3 4390 DNA Homo sapiens CDS (425)...(3286) 3 ctctagctct cagcggctgc gaagtctgta aacctggtgg ccaagtgatt gtaagtcagg 60 agactttcct tcggtttctg cctttgatgg caatttcctt cggtttctgc ctttgatggc 120 aagaggtgga gattgtggcg gcgattacag agaacgtctg ggaagacaag ttgctgtttt 180 tatgggaatc gcaggcttgg aagagacaga agcaattcca gaaataaatt ggaaattgaa 240 gatttaaaca atgttgtttt aaaatattct aacttcaaag aatgatgcca gaaacttaaa 300 aaggggctgc gcagagtagc aggggccctg gagggcgcgg cctgaatcct gattgccctt 360 ctgctgagag gacacacgca gctgaagatg aatttgggaa aagtagccgc ttgctacttt 420 aact atg gaa gag cag ggc cac agt gag atg gaa ata atc cca tca gag 469 Met Glu Glu Gln Gly His Ser Glu Met Glu Ile Ile Pro Ser Glu 1 5 10 15 tct cac ccc cac att caa tta ctg aaa agc aat cgg gaa ctt ctg gtc 517 Ser His Pro His Ile Gln Leu Leu Lys Ser Asn Arg Glu Leu Leu Val 20 25 30 act cac atc cgc aat act cag tgt ctg gtg gac aac ttg ctg aag aat 565 Thr His Ile Arg Asn Thr Gln Cys Leu Val Asp Asn Leu Leu Lys Asn 35 40 45 gac tac ttc tcg gcc gaa gat gcg gag att gtg tgt gcc tgc ccc acc 613 Asp Tyr Phe Ser Ala Glu Asp Ala Glu Ile Val Cys Ala Cys Pro Thr 50 55 60 cag cct gac aag gtc cgc aaa att ctg gac ctg gta cag agc aag ggc 661 Gln Pro Asp Lys Val Arg Lys Ile Leu Asp Leu Val Gln Ser Lys Gly 65 70 75 gag gag gtg tcc gag ttc ttc ctc tac ttg ctc cag caa ctc gca gat 709 Glu Glu Val Ser Glu Phe Phe Leu Tyr Leu Leu Gln Gln Leu Ala Asp 80 85 90 95 gcc tac gtg gac ctc agg cct tgg ctg ctg gag atc ggc ttc tcc cct 757 Ala Tyr Val Asp Leu Arg Pro Trp Leu Leu Glu Ile Gly Phe Ser Pro 100 105 110 tcc ctg ctc act cag agc aaa gtc gtg gtc aac act gac cca gtg agc 805 Ser Leu Leu Thr Gln Ser Lys Val Val Val Asn Thr Asp Pro Val Ser 115 120 125 agg tat acc cag cag ctg cga cac cat ctg ggc cgt gac tcc aag ttc 853 Arg Tyr Thr Gln Gln Leu Arg His His Leu Gly Arg Asp Ser Lys Phe 130 135 140 gtg ctg tgc tat gcc cag aag gag gag ctg ctg ctg gag gag atc tac 901 Val Leu Cys Tyr Ala Gln Lys Glu Glu Leu Leu Leu Glu Glu Ile Tyr 145 150 155 atg gac acc atc atg gag ctg gtt ggc ttc agc aat gag agc ctg ggc 949 Met Asp Thr Ile Met Glu Leu Val Gly Phe Ser Asn Glu Ser Leu Gly 160 165 170 175 agc ctg aac agc ctg gcc tgc ctc ctg gac cac acc acc ggc atc ctc 997 Ser Leu Asn Ser Leu Ala Cys Leu Leu Asp His Thr Thr Gly Ile Leu 180 185 190 aat gag cag ggt gag acc atc ttc atc ctg ggt gat gct ggg gtg ggc 1045 Asn Glu Gln Gly Glu Thr Ile Phe Ile Leu Gly Asp Ala Gly Val Gly 195 200 205 aag tcc atg ctg cta cag cgg ctg cag agc ctc tgg gcc acg ggc cgg 1093 Lys Ser Met Leu Leu Gln Arg Leu Gln Ser Leu Trp Ala Thr Gly Arg 210 215 220 cta gac gca ggg gtc aaa ttc ttc ttc cac ttt cgc tgc cgc atg ttc 1141 Leu Asp Ala Gly Val Lys Phe Phe Phe His Phe Arg Cys Arg Met Phe 225 230 235 agc tgc ttc aag gaa agt gac agg ctg tgt ctg cag gac ctg ctc ttc 1189 Ser Cys Phe Lys Glu Ser Asp Arg Leu Cys Leu Gln Asp Leu Leu Phe 240 245 250 255 aag cac tac tgc tac cca gag cgg gac ccc gag gag gtg ttt gcc ttc 1237 Lys His Tyr Cys Tyr Pro Glu Arg Asp Pro Glu Glu Val Phe Ala Phe 260 265 270 ctg ctg cgc ttc ccc cac gtg gcc ctc ttc acc ttc gat ggc ctg gac 1285 Leu Leu Arg Phe Pro His Val Ala Leu Phe Thr Phe Asp Gly Leu Asp 275 280 285 gag ctg cac tcg gac ttg gac ctg agc cgc gtg cct gac agc tcc tgc 1333 Glu Leu His Ser Asp Leu Asp Leu Ser Arg Val Pro Asp Ser Ser Cys 290 295 300 ccc tgg gag cct gcc cac ccc ctg gtc ttg ctg gcc aac ctg ctc agt 1381 Pro Trp Glu Pro Ala His Pro Leu Val Leu Leu Ala Asn Leu Leu Ser 305 310 315 ggg aag ctg ctc aag ggg gct agc aag ctg ctc aca gcc cgc aca ggc 1429 Gly Lys Leu Leu Lys Gly Ala Ser Lys Leu Leu Thr Ala Arg Thr Gly 320 325 330 335 atc gag gtc ccg cgc cag ttc ctg cgg aag aag gtg ctt ctc cgg ggc 1477 Ile Glu Val Pro Arg Gln Phe Leu Arg Lys Lys Val Leu Leu Arg Gly 340 345 350 ttc tcc ccc agc cac ctg cgc gcc tat gcc agg agg atg ttc ccc gag 1525 Phe Ser Pro Ser His Leu Arg Ala Tyr Ala Arg Arg Met Phe Pro Glu 355 360 365 cgg gcc ctg cag gac cgc ctg ctg agc cag ctg gag gcc aac ccc aac 1573 Arg Ala Leu Gln Asp Arg Leu Leu Ser Gln Leu Glu Ala Asn Pro Asn 370 375 380 ctc tgc agc ctg tgc tct gtg ccc ctc ttc tgc tgg atc atc ttc cgg 1621 Leu Cys Ser Leu Cys Ser Val Pro Leu Phe Cys Trp Ile Ile Phe Arg 385 390 395 tgc ttc cag cac ttc cgt gct gcc ttt gaa ggc tca cca cag ctg ccc 1669 Cys Phe Gln His Phe Arg Ala Ala Phe Glu Gly Ser Pro Gln Leu Pro 400 405 410 415 gac tgc acg atg acc ctg aca gat gtc ttc ctc ctg gtc act gag gtc 1717 Asp Cys Thr Met Thr Leu Thr Asp Val Phe Leu Leu Val Thr Glu Val 420 425 430 cat ctg aac agg atg cag ccc agc agc ctg gtg cag cgg aac aca cgc 1765 His Leu Asn Arg Met Gln Pro Ser Ser Leu Val Gln Arg Asn Thr Arg 435 440 445 agc cca gtg gag acc ctc cac gcc ggc cgg gac act ctg tgc tcg ctg 1813 Ser Pro Val Glu Thr Leu His Ala Gly Arg Asp Thr Leu Cys Ser Leu 450 455 460 ggg cag gtg gcc cac cgg ggc atg gag aag agc ctc ttt gtc ttc acc 1861 Gly Gln Val Ala His Arg Gly Met Glu Lys Ser Leu Phe Val Phe Thr 465 470 475 cag gag gag gtg cag gcc tcc ggg ctg cag gag aga gac atg cag ctg 1909 Gln Glu Glu Val Gln Ala Ser Gly Leu Gln Glu Arg Asp Met Gln Leu 480 485 490 495 ggc ttc ctg cgg gct ttg ccg gag ctg ggc ccc ggg ggt gac cag cag 1957 Gly Phe Leu Arg Ala Leu Pro Glu Leu Gly Pro Gly Gly Asp Gln Gln 500 505 510 tcc tat gag ttt ttc cac ctc acc ctc cag gcc ttc ttt aca gcc ttc 2005 Ser Tyr Glu Phe Phe His Leu Thr Leu Gln Ala Phe Phe Thr Ala Phe 515 520 525 ttc ctc gtg ctg gac gac agg gtg ggc act cag gag ctg ctc agg ttc 2053 Phe Leu Val Leu Asp Asp Arg Val Gly Thr Gln Glu Leu Leu Arg Phe 530 535 540 ttc cag gag tgg atg ccc cct gcg ggg gca gcg acc acg tcc tgc tat 2101 Phe Gln Glu Trp Met Pro Pro Ala Gly Ala Ala Thr Thr Ser Cys Tyr 545 550 555 cct ccc ttc ctc ccg ttc cag tgc ctg cag ggc agt ggt ccg gcg cgg 2149 Pro Pro Phe Leu Pro Phe Gln Cys Leu Gln Gly Ser Gly Pro Ala Arg 560 565 570 575 gaa gac ctc ttc aag aac aag gat cac ttc cag ttc acc aac ctc ttc 2197 Glu Asp Leu Phe Lys Asn Lys Asp His Phe Gln Phe Thr Asn Leu Phe 580 585 590 ctg tgc ggg ctg ttg tcc aaa gcc aaa cag aaa ctc ctg cgg cat ctg 2245 Leu Cys Gly Leu Leu Ser Lys Ala Lys Gln Lys Leu Leu Arg His Leu 595 600 605 gtg ccc gcg gca gcc ctg agg aga aag cgc aag gcc ctg tgg gca cac 2293 Val Pro Ala Ala Ala Leu Arg Arg Lys Arg Lys Ala Leu Trp Ala His 610 615 620 ctg ttt tcc agc ctg cgg ggc tac ctg aag agc ctg ccc cgc gtt cag 2341 Leu Phe Ser Ser Leu Arg Gly Tyr Leu Lys Ser Leu Pro Arg Val Gln 625 630 635 gtc gaa agc ttc aac cag gtg cag gcc atg ccc acg ttc atc tgg atg 2389 Val Glu Ser Phe Asn Gln Val Gln Ala Met Pro Thr Phe Ile Trp Met 640 645 650 655 ctg cgc tgc atc tac gag aca cag agc cag aag gtg ggg cag ctg gcg 2437 Leu Arg Cys Ile Tyr Glu Thr Gln Ser Gln Lys Val Gly Gln Leu Ala 660 665 670 gcc agg ggc atc tgc gcc aac tac ctc aag ctg acc tac tgc aac gcc 2485 Ala Arg Gly Ile Cys Ala Asn Tyr Leu Lys Leu Thr Tyr Cys Asn Ala 675 680 685 tgc tcg gcc gac tgc agc gcc ctc tcc ttc gtc ctg cat cac ttc ccc 2533 Cys Ser Ala Asp Cys Ser Ala Leu Ser Phe Val Leu His His Phe Pro 690 695 700 aag cgg ctg gcc cta gac cta gac aac aac aat ctc aac gac tac ggc 2581 Lys Arg Leu Ala Leu Asp Leu Asp Asn Asn Asn Leu Asn Asp Tyr Gly 705 710 715 gtg cgg gag ctg cag ccc tgc ttc agc cgc ctc act gtt ctc aga ctc 2629 Val Arg Glu Leu Gln Pro Cys Phe Ser Arg Leu Thr Val Leu Arg Leu 720 725 730 735 agc gta aac cag atc act gac ggt ggg gta aag gtg cta agc gaa gag 2677 Ser Val Asn Gln Ile Thr Asp Gly Gly Val Lys Val Leu Ser Glu Glu 740 745 750 ctg acc aaa tac aaa att gtg acc tat ttg ggt tta tac aac aac cag 2725 Leu Thr Lys Tyr Lys Ile Val Thr Tyr Leu Gly Leu Tyr Asn Asn Gln 755 760 765 atc acc gat gtc gga gcc agg tac gtc acc aaa atc ctg gat gaa tgc 2773 Ile Thr Asp Val Gly Ala Arg Tyr Val Thr Lys Ile Leu Asp Glu Cys 770 775 780 aaa ggc ctc acg cat ctt aaa ctg gga aaa aac aaa ata aca agt gaa 2821 Lys Gly Leu Thr His Leu Lys Leu Gly Lys Asn Lys Ile Thr Ser Glu 785 790 795 gga ggg aag tat ctc gcc ctg gct gtg aag aac agc aaa tca atc tct 2869 Gly Gly Lys Tyr Leu Ala Leu Ala Val Lys Asn Ser Lys Ser Ile Ser 800 805 810 815 gag gtt ggg atg tgg ggc aat caa gtt ggg gat gaa gga gca aaa gcc 2917 Glu Val Gly Met Trp Gly Asn Gln Val Gly Asp Glu Gly Ala Lys Ala 820 825 830 ttc gca gag gct ctg cgg aac cac ccc agc ttg acc acc ctg agt ctt 2965 Phe Ala Glu Ala Leu Arg Asn His Pro Ser Leu Thr Thr Leu Ser Leu 835 840 845 gcg tcc aac ggc atc tcc aca gaa gga gga aag agc ctt gcg agg gcc 3013 Ala Ser Asn Gly Ile Ser Thr Glu Gly Gly Lys Ser Leu Ala Arg Ala 850 855 860 ctg cag cag aac acg tct cta gaa ata ctg tgg ctg acc caa aat gaa 3061 Leu Gln Gln Asn Thr Ser Leu Glu Ile Leu Trp Leu Thr Gln Asn Glu 865 870 875 ctc aac gat gaa gtg gca gag agt ttg gca gaa atg ttg aaa gtc aac 3109 Leu Asn Asp Glu Val Ala Glu Ser Leu Ala Glu Met Leu Lys Val Asn 880 885 890 895 cag acg tta aag cat tta tgg ctt atc cag aat cag atc aca gct aag 3157 Gln Thr Leu Lys His Leu Trp Leu Ile Gln Asn Gln Ile Thr Ala Lys 900 905 910 ggg act gcc cag ctg gca gat gcg tta cag agc aac act ggc ata aca 3205 Gly Thr Ala Gln Leu Ala Asp Ala Leu Gln Ser Asn Thr Gly Ile Thr 915 920 925 gag att tgc cta aat gga aac ctg ata aaa cca gag gag gcc aaa gtc 3253 Glu Ile Cys Leu Asn Gly Asn Leu Ile Lys Pro Glu Glu Ala Lys Val 930 935 940 tat gaa gat gag aag cgg att atc tgt ttc tga gaggatgctt tcctgttcat 3306 Tyr Glu Asp Glu Lys Arg Ile Ile Cys Phe 945 950 ggggtttttg ccctggagcc tcagcagcaa atgccactct gggcagtctt ttgtgtcagt 3366 gtcttaaagg ggcctgcgca ggcgggacta tcaggagtcc actgcctcca tgatgcaagc 3426 cagcttcctg tgcagaaggt ctggtcggca aactccctaa gtacccgcta caattctgca 3486 gaaaaagaat gtgtcttgcg agctgttgta gttacagtaa atacactgtg aagagacttt 3546 attgcctatt ataattattt ttatctgaag ctagaggaat aaagctgtga gcaaacagag 3606 gaggccagcc tcacctcatt ccaacacctg ccatagggac caacgggagc gagttggtca 3666 ccgctctttt cattgaagag ttgaggatgt ggcacaaagt tggtgccaag cttcttgaat 3726 aaaacgtgtt tgatggatta gtattatacc tgaaatattt tcttccttct cagcactttc 3786 ccatgtattg atactggtcc cacttcacag ctggagacac cggagtatgt gcagtgtggg 3846 atttgactcc tccaaggttt tgtggaaagt taatgtcaag gaaaggatgc accacgggct 3906 tttaatttta atcctggagt ctcactgtct gctggcaaag atagagaatg ccctcagctc 3966 ttagctggtc taagaatgac gatgccttca aaatgctgct tccactcagg gcttctcctc 4026 tgctaggcta ccctcctcta gaaggctgag taccatgggc tacagtgtct ggccttggga 4086 agaagtgatt ctgtccctcc aaagaaatag ggcatggctt gcccctgtgg ccctggcatc 4146 caaatggctg cttttgtctc ccttacctcg tgaagagggg aagtctcttc ctgcctccca 4206 agcagctgaa gggtgactaa acgggcgcca agactcaggg gatcggctgg gaactgggcc 4266 agcagagcat gttggacacc ccccaccatg gtgggcttgt ggtggctgct ccatgagggt 4326 gggggtgata ctactagatc acttgtcctc ttgcccgctc atttgttaat aaaatactga 4386 aaac 4390 4 19 DNA Artificial Sequence PCR Primer 4 gcaggcggga ctatcagga 19 5 21 DNA Artificial Sequence PCR Primer 5 agtttgccga ccagaccttc t 21 6 24 DNA Artificial Sequence PCR Probe 6 tccactgcct ccatgatgca agcc 24 7 19 DNA Artificial Sequence PCR Primer 7 gaaggtgaag gtcggagtc 19 8 20 DNA Artificial Sequence PCR Primer 8 gaagatggtg atgggatttc 20 9 20 DNA Artificial Sequence PCR Probe 9 caagcttccc gttctcagcc 20 10 21580 DNA Homo sapiens exonintron junction (15377)...(15378) exon 1bintron 1b 10 tggccagggg ctcaccctct cgcaccgggc gtccctctgc gcgcagcttc tctcgccccc 60 ccgcgccaga cccgggcgaa tggcagcacc gtgggaccct gccttgaccg cccccgccct 120 tcggcggcct ctcccagcag ccggcaggct cttgggcgcg ccaacagagg ggcgcggctg 180 cggctgtagt cgcagccagt tcccgttccg ggcccgcgag gcagccgccc cggtcctgcc 240 cctccctcgc gctactgcgg gagcagcgtc ctcccgggcc acggcgcttc ccggccccgg 300 cgtccccgga ccatggcgct ctccgggctc tcctctagct ctcagcggct gcgaagtctg 360 taaacctggt ggccaagtaa gtcccagcga ctggggattc gcgcggggca ggccctttct 420 gaggtcctgg gcgctgcgag tgaggaggcg cagggaggcg ggatttgcgt gcgggcggaa 480 cgcagcgcgg ctctggagga gctctgggtg gaaccaagcg gagaaacccg cgagtttgaa 540 gcatgtagcg aaagttgaga gggatgaact tcacagtcag cggaatcgtt tatcccactg 600 tggtcgaacg caggggttct caatcgtgcc agcagcttag aaccacctgg ggagctttta 660 aaatccagat atccaggctg caccctagat caattccatc agaatctcac ggagtcagac 720 ccaggcctcc gtatctttta aaagctcccc aagtgattcc agtgtgcagc cagcgttcga 780 gggtttgtgg caaaggctgg aagggcagac aggggccttc atggagtccc gcctgcagac 840 gggacagcag ctcccagtgt cctgcttggt cctggagaga gggtgagaac ttcccttggg 900 tttcatgctc cacaaagtaa ggaaatgaga caatgcttgg caaggtcgcc tgaatatcac 960 attcaaaaac gcctccaatg tgtgcagttg ttttggcaca ttgtgaaaaa cataggaaat 1020 gacagaggtt gatgtctcat tagctctgca ttctaggaaa catttcggtt gttggtgttt 1080 gaaattaagt ctggggaagc taagctagta aacccatggc cttgatgact tctggccttt 1140 ctgctttaag ggtgaagcca gggccgggcg cggtggctca cgcctgtaat cccagcactt 1200 tgggaggcca aggcaggcgg atcacctgag gtcgagagtt cgagaccagc ctggcccaac 1260 atggtgaaac cgtctctact aaaaaaaaaa aaaaaaaaaa ttaaccgggc gtggtggcgc 1320 atgcctgtaa tcccagctac tcgggaggct gaagcaggag aattgcttga acccgggagg 1380 cagaggttgc agtgagccga gatcgccctg ttgcgctcca gcctgggcaa caagagcaaa 1440 actccgtcaa aaacaaacaa acaaacaaaa aacaaaaaaa cggtgaagcc agaagtcgtg 1500 cttgccaaag ggtcgagttt gttctccctc aaagcccctg ttgaagattg aactatcact 1560 ttcagggaag agtaaaagag taactccacg atgcatctta gagaggagtg gattccctgt 1620 tctcacccag gcttagatgc caggggccag gtagctgaaa tccaggcaaa ccaggcattg 1680 acaaagtaca gacttctacc gaatatgcca gacagataag caagctgtgt ttaaaacaaa 1740 cagcagagtg gtagaagagg gctctttcaa atattgtaag aagagtaggt tttatttttg 1800 tggagtggag aaataagttc acgctttgga acccatcaaa tctgggtcaa aactcggatt 1860 ctgtcacttc tatgctgtga ctttgggcaa gttccttgac ctctccaagc ccctgttttc 1920 ccacctgtaa aataaagagc aaccctctcc tgaggctagc atagttcagc gagatgatgt 1980 gcatcacaca cctggaaggt ggtggcgctg gcaggtgctc tggcagaggt agttattaga 2040 cactggagtg gcagtttgtg cccgttaatt gtttacttag tacagccact atgagggaga 2100 atgtctataa aatgttccct tcaacgctcc attccttctc ctgccctgct gccccgctca 2160 gggccttagc gggctttaag tctgaaatct taggctgcaa atcccctcag tcccccaggg 2220 cgtattaatg gctctggaat acagttttac tgcatgtgat aggaatgaat attttgacac 2280 acacacactc aaaagctgtt cttaatcttg agacttcctc ctctctgttg ccacatatat 2340 atcttttaaa tggctttcca agaaagctgg gctgtgcaca gtgtctcaga attagtgtat 2400 gtgcacacac acgcacgttg ctccctgctg acaacatcct cagaccatac ccttggtgtt 2460 cctgttttgc tcctgactct tggcctaggg acagtgaatc tttcttgaat gaccaaggac 2520 agtcagggat ctgaggaagg gagaacccaa attctacaat tgctgatgcc tgcgtcccag 2580 ctggaacttt accctggtac gtgctctgtg catcatggat ggtcactttg atcctaacag 2640 cagtgctgtt tagattagac atcagaaata gaaacagtaa gtaatatagg cataaagatc 2700 aagtgttttt ctagtgtatg tgtgatcact gatatttgct tacacaagac tatcaaagtc 2760 ttatctctta ttcaaagaaa aggattcaat atttgttcaa aatctaagat atgaacgtag 2820 gatttaaggt ttaagttact agtttcttct atcattctta ttattggggg gatttttttt 2880 aggcaagaag aaattagaaa agctaaaatg tgtacttatg ctgaattttt gtcatgcaga 2940 ggctctaggt attttgcaag ttttcagaag tcccattttc ttttatcttc tagggacagt 3000 cagttgcatg tcacacaagg ggcctccttt gatttatcca aaaggtctga cctggtttct 3060 ggaaaaactc tttagcgcat caaatactcg ttgagggcaa gtgttgtttc taaacttgac 3120 tcttgtggtg agaggtgagt agtcactggc atgtcagtta gctgacattt tttacccacg 3180 ctagaaagtt cattcaggca agacttcctg gcttgaactc ttatttaaac atgtgtttta 3240 tctggaactg ccacagtcca aactgtgggc agagattcct gtgcagagtt ggctaaggca 3300 ggcatgccac cggcccttct cttcgccatc ctccctgtct ctggagccat tgagccaacc 3360 ggggatgatg ccactctgaa gggatctcac cttggtcctc taatggggca gagttccctg 3420 atgagggatg cacgtcctta gagccaacaa ggcaattttg tattatactg catggaaatt 3480 aaaagttact ctccagtaaa ctgccacttt agaatttgtt ttactgaagc gtccttttcc 3540 tctaaattat agagcactgt agtaagggcc cagataattt caaagtgagc accagattag 3600 tgccagattt tggcagatta agattttaat aaaacatgaa cattcacctt gattacattt 3660 gtttcagaaa gttgatgtcc cttggccttc tgttctgtca ttgccaccat acaggttagg 3720 gtcagaattg ggagaatgta gtcatttggt agatattttc agagccacat gcaactgggc 3780 ctggcagtca catattgcta aggaaacgcc ttacatgata gcagagctga gagcagatga 3840 tgatttactt gggagagtat tcgttttcta attcccaaaa gattaaaagt ctaatgagta 3900 agaacttccc ctgcttcttg taacacatgc ccagattgct gcctatgacc tcaaaaccaa 3960 accttcttgg aaggaactgg gggaaagcta ggcttttttg gcaggggata tcctaactcc 4020 ttacaacaca ggccccttca gtgagtgggt catttctgga ctccatttaa caaagctagt 4080 gtttcatact taatatgaaa acaggcgtgt tgatgatttc aaatagagat gttgggtcta 4140 taaagtatac cagacatgca gggctgaaat ggaagaaaca atgggttttg gagtcagaac 4200 caggatccaa ttcctgtttt attccttact tcaggggtcc ccaatcccta ggccatggac 4260 caataccagt cggtggcctc ttaggtgggc cgcacagcag aaggtgagca gcagtgagca 4320 agcattaccg cctgaactct gcctcctgtc agatcagcgg tggcatcaga ttctcatgcg 4380 aggggaaccc tatggtgaac cgcacacgcg agggatctag gctgcacatt ccttatgaga 4440 atttaatgcc tgatgatctg aggtagacag tttcatccta aaacgatctc ccctgtgccc 4500 ctactcgccg ctgtctgtgg aaaaattgtc ttccatgaaa ccagttcctc gtgccaaaaa 4560 ggttggggat cactgcctta actaactgtg tgaccttcac caacatactt aaccactcca 4620 actctgtgtc ctcctttgta aactagttac aataatgctt tccttgctgg gcagttttca 4680 gtatttgatg agctaatcta tgtaaagtgc ttagcacatc acccagcttc tagtaagtca 4740 tgattatgat gataaagatt atgatgatta ctttggttcc agggctggtc tttccttctc 4800 ctagttgtgt gacttcagat aaggttttta atcttcttga gccttggctt ttgaatctgt 4860 aaaatgggct aactctacct atctggaaag attgctttga ggcttgattt gattaaatga 4920 aatgatgaca ggcctgatct ggtagaccct aaataagaat agcatttaca tagacttcct 4980 gtgtcccaca ctggtctaag agtttagggt ttttgttgtt tgtttttagc tcattttatc 5040 ttcccagcgc ccctgtgcag tggttatcat ttcccttgtt ttatagagga gacaactgag 5100 gctcaacaag attaaataac tagcccaaag tctatcccag gacttgaagc agagtccctg 5160 ttcttaacca ctgccccagc tgctcagtga gtacccccac ccctgcttat tctacctacc 5220 tcctgatgcc catctcctgc cattaacagt cagatgtgct tatgctctgt gtagggccat 5280 atttctggaa acagagcccc tgcttccaaa gagcagctta cttaggaagt gaaagtggca 5340 gctcactcct cctgtcgcct cctcccactt ccagataaca ttaaacctca tccattgctt 5400 aggctggtta atatcaactg gaaaaaaaaa tctgataccg atattgacta acattattag 5460 agtgggttgc aatgttagtt ttgagataaa tgtcatctcg atgcactgag attgttagcc 5520 tgtgtgtaat aaaacatgcc ttcacccaga cagatgtggg gggtgtttgg aagacagaaa 5580 ccaggcctag gagatcccag gttgagactc aggtggacag caacagtatt gaaaataata 5640 ttacatcaaa ggtgcagtgg tctgttctaa gtacaccacc ttttctgaca ttccttactc 5700 atcctccctc tcctttaccc cagtggcttt ggcctcttct tgagttctgg caatcttccc 5760 tgaggtctgg gtcagtggaa tttgggggcc cagaaaggca aattctcagt tgagtgaggt 5820 gtggcttctt tagcattcct gataatgagg tagtaattgc ttttctggct ttctgtgtgt 5880 gtgaagtgct ctgctggccc cctaccctgg tgtggaaggc tgttagaaag ctatttatag 5940 ctcttggttc cttctggcac tttggggctt caattctgca acatgttaat tcaacaaata 6000 ttatcagaaa gtcttgctgg tgccaggagt ctactaagta cttggattac agcaatgaat 6060 gaaacagacc ctgcctctgg ccatatggaa tttggaggct gccgggtgga gaggatgtct 6120 tagtcagctt gggttgctaa aactatcaca gactggggga cttaagcaac aaacatttat 6180 ctctcacagt tctgcaggct ggaagtgtga gacctggttg ccagcatggg agtttggtga 6240 ggaccctctt cctgatttgt gaaagaggtt tctcatgtct cttgctataa gggcactaat 6300 cccatcattg gggctccacc cccacgacct catttaaacc tattcacctc ccaacagccc 6360 tatctgtaaa caccatcaaa tcgggggttc tccctcacca tgtgaacttt ggagggacat 6420 aaacatcagt ccataagtag gaggttaggg cagtaagcac acaggtaaat acataattgt 6480 aacttgtgga aagtgcttta agtgaaaaca acagcatgtc atgagacagg agatcagagc 6540 agaactagaa ttaggagagt ctggggacgc ctctcaagga atgagccttt tcaactagga 6600 cttgggatga aaaaaagagt gaaatcatgc tggatgcaga gaacagtggg ttcaaagacc 6660 cagaggtgaa tgggaaggat gtgggtgtgt tgagtggctg gtgtggtaag gcccaagccc 6720 aggagggtac agagctaaat aatgagacac actcaggcca agtagggcct cctgggctgg 6780 gtaaggattt gggattttat tctgaagtgc ggagggaagc tttggaggaa tttcccctcc 6840 tgactttttc ctgtcattca taatatacaa gaaaaagatc attagtcact tctatttatt 6900 cagataatga acaatctcta cttcttaagc tatataaacc ttgtcttagt gtgtgaaaca 6960 attctggagt gtctagtttt caatctccag atgagggaac tgaggctcaa aacaaagtga 7020 aagaaggcag aggcaggatt tggaaccagg cctgcccagc ccacggcaca tgctctgtag 7080 ttttctggga cagatgagtg taaatgcata tgggcctcct aggcacccgg catccttcat 7140 caggagcttc agtggatgga ctagggtgga gtccagtcct tcctggctct ttagaaacag 7200 gaacaaaaga ccccgaagag tggagggagg ttgggcatgt acaggtcagg gggtcagaat 7260 ctgtaggcac caaatgtcag ggccccctgg ctatgggaag ggctgtcagt ttttcacgct 7320 agtctgttct gctccgccta gctgttgccc agaaatccca aagagggtgc cttcccagct 7380 ctgttcacct ttgaaaatct gaattcatca tttgtgtgca cttgagagac ccagggtttg 7440 gtaaaaacaa aattctttat tgggtgtttt caggtcacag gaaaagcctc cactgtgtgg 7500 gacccacatc cttagcactg catgctggct gcttccaggc atctgcactg accctgtttc 7560 tccttccttg ctgtgaagcg gtcatgaggt tgcagtgcca gatgtgttgc aggccaggtg 7620 tatctcggct ggttgggcag gccttctgct taacatggga cctcacttag tgagggaaga 7680 gagcagtcct ggcccaaggg gtgcagtgtc tggggcagat gctttagggg ttatggctct 7740 atcatcttct caacctctta atgccactca gggtcaccac acataaatcc ctgggtactc 7800 tcccacttag ctctgtgaca ggtttgtgtc ctagatgctt gctgactctg tatacatgtt 7860 caatggggtt aataaacgtc ttcagctaag cattttggcc agcgatcagc agagtgttta 7920 tttgctgtgg gcccggtttc aactcccact ttggtggtga ctaatatatt ttgttcttgt 7980 ggagtgcttt tcacttgata agccccagac actttctgtg tcagtcctaa atgccatatc 8040 cagatgtcct ccagcttcca gatcggccac ttctgtggca ctttcctctg cataaacact 8100 tcagccctct cgcagagtgg agttccagga aaccggtttc cacttcaagg ctctgttgtt 8160 ttctgctggc tttcttcatt aagatatttt tattcactgg cccctcagag atgaatatca 8220 agtgcagtgt tttgggaact ttgtttttca aagcctggat aaaaccaaga aaatgtcaag 8280 cagtaggtga ttaagtacca agtggtcgca caggcaatgc ttttgacatt gacagctggg 8340 ctctggttaa ccttttaaag acagcagcga agataaatgc cttagagcaa cagaaactat 8400 tggtgatgga aagttgaatt tattcaccaa taaacttggt aagtgacatt ccctaagtga 8460 gcctgtgaat tatggaaatc cacgtatcag tggacatgat gaagcacctg ctatacgcag 8520 tgcacccggc cagattcatt cattcgacgg ttcctatcga gcaccctcta tgtgctggag 8580 atacggtgac tcaccagatg gtcgccctcc cttgtcatca gtgctggcct tgaagatgac 8640 agatggaaat gctggcctag gctgccctct tgtaggtcac ggccggtgct tactatgtga 8700 ccagtgctgt tctaaatacc gtatgtacgt gtattaactc actgcatgct cacagagatc 8760 cctacatgga gtaggtgcag ttattatctg tacttacaga gaagaaaaca gaagcacaga 8820 agagagcatt atttgtccaa gtcacatagc tggtgattgg aaccagatag cctggcttta 8880 gagtctgtgt ccttcactgc cactaccaac caatgtgaaa ggaaaataat caacttagga 8940 cacaaagttg cacgccctct gagagaggcg gggatagaat accatgggcc ttcccagcag 9000 ggagagatgg gcttagggga gaaacagagt aaggtgggaa ccctaccctt ttcccattct 9060 gacaggtgac atccccgctg gggggatgga ggaaatgttt ggctactgag gaggctactc 9120 tggccagaag tagacaaaag aagcaactgt ctcgcagcca ggagacttgg atgctagttt 9180 tgattgatcc ggacaagtcc agggccctgc ccagggcctc tgttttccca tctctaaaat 9240 aggtggcttg gacaacattc tgtttgacct caggagccca ggatttcagc agagaaggga 9300 ggagctggct ctctgtgcac tctgaacagg gtttcctgac cagaagaaga tgtttggctc 9360 tgagagatgt gggggcaggc attggacttc ccagaaggcc ttggcagctg tgaaaggatg 9420 tatgcgttcc actgtggtcc caccccgaat gctgacgttg gctgtgtctg cattcctttc 9480 tttgtaggca ggatggcagt gtctgtgtgt ctggagctca cactataagg tactttgtat 9540 gagagacaga gggagctaga gggagggaag agggagtagg gaggggaagg aagccgggag 9600 agagagaggt gattccggga gggactgact tccatctgct ttcaaattct gagggattaa 9660 gtgctttcag atacttaaag tcagtgagtg tcaatggagt taaaaagctt ccaataataa 9720 ttgtagcttc atccatctgc taaattgtct ccagagtcgg ctgcctatag tgttctggca 9780 tggcagaggc ccacatgacc gggcttgttt tccaggttaa gttccttaag aaattgtaaa 9840 ccaagcaaga aaaacaaaca gacaacaaca gaacacagat tgttgtggat catgcagcag 9900 gaaaagtagg acaccatttg agctgaccaa gcatatgtgt tctccgtgca gctctgtcga 9960 agctgtgtaa ggtgtgcttg gtgctggctc ctgccagatg ctgttggcat ggcttcatcc 10020 acccagcttg ctgttgggta acttgcagag tgaagaagtc tgttgttcac gtcatcacca 10080 caaccctgaa catccaagca gttcatgtca acaactagag tataattatc ttctttactt 10140 gtaaaactgt ggagcctgtg gattaaattc accaaaatga atcatattca agagaagtca 10200 tttcctaaag gagaattcag atgtggtccc cattcatggg tatatgtgtg ccagttatgt 10260 tgggggttgg tggagcccac tctaattgac atttgcaatc tgggttatat cacctctgag 10320 gtaggcaggt acctacatga ggtaggcagg gaatgatggc tttaacttag ttcattcaat 10380 aaatgaatat cagttatcta ttcctattgt ccaggatggt ggtgggcaga atgtctgaga 10440 agcccatgtt gtgtatggca ggttgtgact ttaccagaga tggcattttc tggttaaact 10500 tgggtgatag gatgtgtttt taacagaaag ggaaaaagaa tctgaactag tttccttggt 10560 aataataacc atttgggttg gggagctgat actggttagg aaaatgtgag gcccctttga 10620 catcatccat ttgctgctga gaaatcaaaa ttaaaacatc actcggtgtc agaatgttcc 10680 agaaggcagt tgagctgcta atgttttgaa gtgttcagag gtatgtgttt tatgaaaaaa 10740 agaggaaata caagacaaag gaaatgaaac gtttggcaac cattgagcat ttcttcaata 10800 tggtatttct tgtggctctg ggattttaaa aatccacttg aaagccagta ttgttgactt 10860 ttgctctgta ctatttttgt ggggactgat tatttttcac tttatattcc ttatttctaa 10920 ctaggcttaa tggtaatacc ctaagttgat ggccttctac taaatttaaa acaaaatata 10980 ggccgggtgc ggtggctcac tcctgtaatc ctagcacttt gggaggtcga ggcaggtgga 11040 tcacctgagg tcaggagttc aagaccagcc tggccaacgt ggtgaaaccc cgtctctact 11100 aaaaatacaa aaattagccg ggtgtggtgg caggtgcctg taatcccagc tactcaggag 11160 gctgaggcag gagaatcact tgaacctatg aggcggaggt tgcagtgagc caagatcacg 11220 ccatcgcact ccagcctggg tgacaagagt gaaactctgt ctcaaaacaa acaaacaaac 11280 aaaaataaca acaacaacaa aaaaaactgt tcccacccac aatcccatca gatttattgt 11340 gatactttca caaaacagga agagcttttt attgatttac aggtgcacag aaagtcttaa 11400 catctcatct gagtttgttt ggtcttaatg aaccatcgtt tagtttaatg aagaccaagc 11460 attttctctc tagggcaggt gggccctttg agttgaaagt catagttcct caaaacaagg 11520 cagtcgctga gaatgttgtg ctggcctgcc tcaagatgtc ctgagatgcc ctaatgagga 11580 aaaggcccta aaatatttca atcatgcaag tgtatagcct tcttataaaa aaaaaaacat 11640 tacgaataaa agtcctaccc tactcctggt gtcctctccc tttccagaag taattttcat 11700 tctcacaccg tgtgtcttct cacaggtggt tttctttgca tttacattca cgggcgtatg 11760 cctatagaga acgtggagtg ctttgtgtat ttggagggca ggggatgtgg gcaagccata 11820 taaatgttat tatactctat atttctttct gcaacttaca tttttcaagt gacaggatgt 11880 cttaaggctc tttctatgtc agtacatgca gctccaccgt gttctttttc atcactggtc 11940 ggcttggctt ttagggatac caacccagta attctaacag aaatgatgga tgtggttctg 12000 tggagccaag tgaggggtgg agtgaggtgg caggacccaa cacaagttgg gagaaggtga 12060 atgatgtaac ccaaggacaa ataaagaagg tgcatgtcct gcctattttt cttcctagta 12120 aagcagtatc tccctatata gattactgct gagtctgagg tagacaggga atggtgttat 12180 attttatctt aaatcttaat tccataaata gttatagtat ctgaaaaagg ttgtagtgac 12240 ttttctgtgc taagagggat cctttaatgt agtgggctca gacaacgaag gttttttgtt 12300 ttgtttattt tttaagaaac aggggtctca cttgtcaccc aggctggagt acagtggtgt 12360 gatcatagct cactgtagcc tcgaactctg aactcctggg ctcaagtgat cctcctgcct 12420 cggcctcctg agtcattagg acagtaggcg tgcaccacca tgcctggcta tcaaacaatc 12480 aaggttgatt tcttcctcat atcttatgtc ctttgcgagt tggcaagagc tactcattgt 12540 ggttatttgt gggcctgagc tgatgcagct gcccccatct tgtgctaatc atagtgccag 12600 aagcaaaaaa aaaaaaattg cacattgata aattaagcag tgatggctaa aagcctccta 12660 cccaaagcaa cacattctgt ttctgctcat gtttcatgag ttaaagcctg taacttcaaa 12720 ggggtaggga agtgtaatcc taccaggtgt ctaggagaag aacctgaact gtttgatgaa 12780 ctgaatactt ctaccactgt gttggttttc aaggaagggg tgggaaagtc tttgaaaact 12840 ctcctgtgac ccataaagtt atatcctaga agccaatcct ttctgtgttc ataaaatcac 12900 tggccttttc ctgtggccgc caaggttgca gagagcagag ctgtttggga actcacttca 12960 agaagtggtc agagcttgag gagggagggt caggacagga gcagaggcag atggcatggc 13020 actgggccag gttccatatg acctgggtgc cagcaaagct ggccttggtt ttgcttatgc 13080 ttgtatttgt taccaaccta tatagcaagt atcacagtag aaaaacttac agaatggcct 13140 gactacacag agctcactgt tgaaaaacgc tgggttgtaa ctctaagata gcatctgcta 13200 cattgctaaa gaatgtttta taacagggct tagatctgtt aggaatctta gctgctactg 13260 ttgcaacacc aatttatgga aagctgtgtt atttattttg aaatataaac atgaaaaaaa 13320 aaaagagcga ataatgattc ccaacaattg ggtgcctgaa gaaagagtga aatcatgagg 13380 ccagtgttga tgatgggaaa tgactttctt gaggtttctg ttctcaatct ggccccatca 13440 gttgggatgg agtctagtct tgtctgctca tgctgggagg aaggcgctca tagtcacaat 13500 ggagcaggga aacctctgct gggtgattct gcacagcatg aagctcctgc taaatggatc 13560 atgtttgcta gtgtttttta ggctgcagag aacaggggca cactgaggct atgttcatgg 13620 gggtttatta taaagataca caaaataagg gaagccgtgg ctggtggcct cagctgtggg 13680 cagccaaacc cacacctcct gtgggttttt gggactcagc atctctctaa atgtctcaaa 13740 ttcaaactcc ctgagagagg agctctaact gggtctccct gtcacctgcc attacagagg 13800 gctgcagggg agcagacacg tctgtgatca tgctggctgg agcccaagga agcccttgag 13860 agccacggca cccccctgtc tctctctgct catttctgct tccacacatt caccatactt 13920 gtgattcctt gttctgtgtc tgtctgtcct gcccagttgt aaactctgtg agggcaggga 13980 gctacttgac ctctgtgtgc ctcagtttcc tcatgtgtaa attggcggta acaatgaccc 14040 ctggcaccta gggttgttgc aaggattgag ataacagaaa ttagtaaatg attcgctcat 14100 tactgagtgc cagccctggc ctaggcattt tgtgcatatt aactcactta atccttacag 14160 caacattttg aggtggaact cttaccttga ggcaatcacg gcacagaaag gctggctaac 14220 ttgctaacaa gtgatggtcc ttataatcag tgatcttata ataaagtccc atgaacagac 14280 ataaactgag agtgcccctg ttccccacag cctaaaatca ccagcaaaca tggtccattt 14340 agcaggaact tccctctaat ttgctcatga gtcagaattt gaacactgct gtctctagag 14400 atcccagaca ccacagattt cacccttatg ttgtcctgcc tctcaccatg tgtataaagc 14460 cttaacatcc atgctgatgc cctagtacaa cacttggtgt tagctattat cagactaggc 14520 cctgctgggg cccttgcacg ttgcagggcc ggacatgagg aagtgctcag tggatctctg 14580 tagaaggaat gaacagctgc aacctcatgg agttgtccca ccactctctg cccctgccca 14640 cccccttgca gctatcccac gggtcacgct gatggtagag cactgtcatg tgcctcgcag 14700 ctggcagagg ctgctgggca aggtgtccag gtttgcttta gcagttcagg caggcatggc 14760 cgggccgagg aaagggagac catttgccat tatctttgag cctccctttg gcacgtggaa 14820 tttctcagat gcagagcttt aaagcagcag agtggcaggt gagttcttac ccttctttat 14880 aagagaaaaa ccagggaaag caaaacattc tgctttctag aagccaagtc atccagaagc 14940 cagcttctgc agctccaaag agggccttag cagttacaaa gaaacacaag ataacggtaa 15000 gagaacaaca aaaacatggt gtaatgccaa acatgcaaaa agtcagtgtg ggcgcaaatg 15060 catgcagaga aaagctgaaa ggaagtgctc caaaatgaga atagtggtta ttgcatgtgg 15120 ataataggga acttaattta ttcctctaat tttttttttt gcatttccca agttttctac 15180 agtgagcagg tattgtttta tacttacaga aaaagttatt aaaacataga aaggtacatg 15240 tggagaacat ctttataaaa tgtttttttt tttttaatct tagcatctca gttataagaa 15300 atgcacatct ccgctcaagg tcccctggga cagtgcaagg aaagaaagaa gcctgcggtg 15360 aaagtgacac tgagtgagtg tgtatctgtg gtttctctgg ggccacccca ggattcaggg 15420 acagagtcct ggttgctctc caggaggtga gactcctttc taggtcaatc atttcctgag 15480 gggtcctggt gacctcacga gatattaata taatatagtt cggttagtac ccatttctgt 15540 ttcccagctt atgaacactt cctgttgctc atcatttttg tttttgtctt gggatcattt 15600 tttttctacc tgaaaaactc tatttagtgt tcattttagt gcagatacgg tagcaacaaa 15660 gtctcagttt ttgtctggaa aaactttatt ttgcctttcc ttaaaagaaa atcaatttca 15720 tggaagtgta atttcataca ataacattca cagatttcaa gcatatgggt tgctgacttt 15780 tgacaagtcg tatgtatcta tatgtatatt catatatata catacacaca catatgtgac 15840 tgtatacaag ggagagagca agcatgcaat caagacacag aacatttcta tctccccaga 15900 aattttcctt attcctcttt gcagtcaatt tctcccccac tcccatctgc cccctgcctc 15960 aggaaaccat tgatctgatt tctagatcag attttcagaa tcatatgcat gaaatctttt 16020 tttttttttt tttgagatgg agtctcactc tgtcacccag gctggagtgc aatggtacga 16080 tcttggctcg ctgcaacctc cgcctcccgg attcaaccaa ttctcctgcc tcagcctccc 16140 tagtagctgg gattacaggt gtgcaccacg atgccctgct aatttttgta tttttcgtgg 16200 agacagggtt tcacaatgtt ggccaggctg gtctcgaact cctggcctca agtgatccac 16260 ctgccttggc ctcccaaagt gctaggatta caagcgtgag ccactgcacc cagccctaag 16320 tactcttttg agtctggctt ctctcactca gtgtagtatc tgtgacactc attcatgatg 16380 gattcttgtt tcattggcat ttattgttga attgtgttct actgtatgaa tatgctatac 16440 tttgtcaatc cattcaccag cagttgtcct ctgtgtgagc tacaagttat tctattatag 16500 ccaggactgg aagtcccttg tttaaaaaga aaaaacagca aaaaaaaaaa aaaaacccat 16560 cagaacccct gattcttttc aaaaggcggg tttctagaag gggaagttac taaatagaaa 16620 gttttcagag tctgttgaca catattgtgt caaaggactt tccagagact tccacagatt 16680 tacatttcca ccagtggtat aggagagtcc tgactcacca aacatcaaca gcatttttgt 16740 atcttactct ctatctttgc aaggatgcaa agctgcaaaa ttgtatgcag cagtgaagag 16800 aattttacat tgagtataaa ccagtaggtc cagtgtgtat tttcttcctg agtttgcaga 16860 attttcagtg acctcacgtc tttaacctct gtctcactga gttgccgatg aaactgatct 16920 gaaaagccaa ttaagatcat tcctggccag gcacggtggc tcacacctgt aatcccagca 16980 ctttgggagg ccgaggtggg tggatcacga ggtcaggaga tcaagaccat cctggctaac 17040 acggtgaaac cgcgtcccta ctaaaaatac aaaaaattag ctgggcatgg tggcaggtgc 17100 ctgtagtccc agctacttgg gaggctgagg caggagaatg gcgtgaaccc gggaggcgga 17160 gcttgcagtg ggccaagatt gtgccactgc actcccgcct gggtgacaga gcgagactcc 17220 gtctcaacaa caacaacaac aacaacaaag atcattcctg gcatcagtat ttcaaatgga 17280 agggcagcag gcaggaactc tgatgtctca acaccctagc ttttagtgat tgccctggag 17340 tggcactgtc cacttgagta gcccctggcc gcatgtggct cttgagcact ggaaatgtgg 17400 ttgatccaaa ctgagatgtg cgattaaaac acatgccagg tttccaagcc atagtatgaa 17460 gaaaaagaat gtaaactatc tcattaataa attttttgtt gatgggctgg gtgcagtggc 17520 tcacacctgt aatcccagca ctttgggagg ctgaggtggg aggatcactt gaggccagga 17580 gttcaagacc agcctgggca atatgatgaa accctgtctc tactaaaaat acaaaaagta 17640 gccaggcatg ggggtgtgca cctgtaatcc cagctactta ggaggcggag gcaggagaat 17700 tgcttgaacc tgggaggcgg aggctgcagt gagccgagat tgcgccaccg cactccagcc 17760 tgggcgacag agtgagtgag acttcatctc aaaaaataat aataaataaa ttttgtatgg 17820 atgacatact gaaatatttt gagtgtatta gaattaaaat atatcattga aattaatttc 17880 acctgtccat ttttactttt attagtgtgg ctattagaaa gttttaaatt actaggaaag 17940 ctgtgtcata tttcaattca acagagctgc tctatggtct cctttttccc tttgaagatc 18000 cgccatcctg caatccctgc ttccttttca ccagagcagc ttccctgaat cttctcctgg 18060 aggctctgca gaccttttct ttagtttgga agatcacatt gcaggagggg acttgggcac 18120 tggtttctgt gaggagcccg gagtggtgaa ttgcccctgg gattccctta ccctggaaac 18180 taaccctctc tgagggcaga aagctagaaa gaagagcggt tgagaggaaa tgcctgtgca 18240 acccccagct cccttcgggg ctccctcact gcccacaggc tctaccccca cctgcctttg 18300 cctcaggaga tggcttttgg tggtggactc accctcctcc gatctcctct gcttcatttt 18360 tccacttggt gatccatctt ttgttctgcg gtgtgtcctg ttttctggct gcctccatta 18420 gcttggtttt ttcctttggg tatccctgga gctgtcttac caggatctcc aacttcagtc 18480 cccattggtc tgcaccactg gcctgggaaa gctccaccca ggagagcaga cccagctccc 18540 agatacctgg ccccagccca atctccgtcc tctctttgcc tggaagagag gaccagaccg 18600 tcttcatcaa ctggacccac cctttaccaa gcaaagaaag gaaaggattg ccccccaggg 18660 ccagcagatc tctggctgtc tggtgtttct ggtaataagt gcccatgaac ttcgaattga 18720 tctccagtct cccatggtat ggttttggtc tacccttaga ttctgtttac caaggcagag 18780 ttcagtttct ctgctttccc ttccaacatc cataccttgt taagttcttg ctgaattgag 18840 ctaactctgc acacctgatt aaatcttcag ccagggctct ggacattata aagcagcctc 18900 cttgccagat ggctgtcata taatatattg ctttatttag gtgattgtaa gtcaggagac 18960 tttccttcgg tttctgcctt tgatggcaag aggtggagat tgtggcggcg attacagaga 19020 acgtctggga agacaagttg ctgtttttat gggaatcgca ggcttggaag agacagaagc 19080 agtgagtaaa acgggcccct cgtggtaggc gggcaaaggt cgggaaagga gggatgaagg 19140 aagctgtgca acaccccttc ccagctttct aaagaatgga gcatggcatt gcaaaatgct 19200 gaatcacaaa gtgagaagtg acttctttcc agttttctct cagccttgtg atgatcttaa 19260 agagaaagtc tcaattctgt gctactgtgt ctttaacatc tctcaaatgc ttccgagaga 19320 aaacaggccc caaccctgga gcctttccag gcagcaggac tagctggaat acagtaacat 19380 tgtggtattt ctggtgaatt aatttttgtg ttactttcta tgtattgcaa aggatatttt 19440 ttttctgttt ctaatggtga cattacattc ctttaaaatt attagagttt tcaaaactca 19500 attgaaagca aaaggttaag caaataaagg acaggtgtga ctcaattatg gcaagaacaa 19560 caaaaaagtg actatggtgg ggacgttgaa catttagaaa acactgtcct aaaagaagaa 19620 ggtacagaga gtagtctgat acctgggagt actagatttt aaaattattc tcttgccaat 19680 tttattcatt cattcaccat gtgccacaat aattttttta agtctttctt tgattttgca 19740 gattccagaa ataaattgga aattgaagat ttaaacaatg ttgttttaaa atattctaac 19800 ttcaaagaat gatgccagaa acttaaaaag gtattattaa ctgctaattt aaatttaata 19860 ttgtcagctg gtatgcttta aatgaaccga ttttctaaag ctaaggatcc taaagtgggt 19920 catagaattg actgccatgg aaaatagccg ggtgtggtgg ctcacacctg taatccaagc 19980 actttgggag gccgaggtgg gcagatcact tgaggtcagg agttcaagac cagcttggcc 20040 aacgtggtga aaccctgtct ctactaaaaa tacaaaaatt agctgggcat ggtggcaccc 20100 gcctgtagtt ctagctactc gggtggctga tgcaggagaa ttgcttgagc tcaggaggca 20160 gatgttgcag tgagccgaga ttgtgccact gcactccagc ctgagccaca cagtgggact 20220 ctgtcaaaaa aaaaaaaaaa agaaaaggct ggattagtat ttaataactt ttattggaac 20280 aataactaat attttcatac ctttaaattt ttttggtgag gatctcttgc ccttctgttg 20340 ctgataattt taaattttac tctttcctgc tctatcattt ggattctaaa aagaaagcta 20400 ttgtgtgggg tctggcattt gataggttaa aaaaagaaaa catagaagca tgtcaaagag 20460 caggagactc atgtgccact tggtggaaaa aaaattgaga accactggac ctgaatgtta 20520 acagatgtgg cttgtgaaac atattaatga tgtggatagt gttaaaagat ctatgagctt 20580 ttttttgaac tacaaaaaaa cctttttttt actctctctc tgaaatgcat gagttttcct 20640 gcctagacaa acgacaaagt ggccaattcc aggcctttcc acttccaaga tcattttaca 20700 cacgcaagta ttcctggaca agaagaggct gaaggatctt ctgtgggatt tttttaaaaa 20760 attattattt tttgctgtta ttgttacaag gaaggtagtg agtgcaattt tgggataact 20820 cagatgaacc caaatgtttt ttaatgccca aataaaatag cctgggtccc tgttcctacc 20880 cagacaagaa actcagcact tgtaccaggt tttcaagggt ttccagagag atacctctct 20940 tgggaatcta atcccaggaa ctctgtgccg gctctccctc ttcctcctgc ccctctgccc 21000 tccctctgtc ccacttcatg ccacatgggc tattgctttt ccctcctttc ccagggtgct 21060 tgtgtataac agactttgct gaaggtcaag gacacggggg aagaggttat agcacaaccc 21120 aaaatgggca acagtgactg agacatcacc gtccgcaggt tccttgcctc tggctaccac 21180 aggagacttc atgctgctgt ctctcttctc aggctatttt gtgctgagac cgtctcccac 21240 caaaccccac aggtctttct ttctcagttc tgggcctgtt ccctgcttgg ggaatgcaga 21300 aacctccaag ccctgttaac cttattatgt aaaataaaca cctttccatc tctatcccag 21360 tataggatca aagcacactt ttgctttata gtaataataa agatgttttg atttaaaaat 21420 aaaaaagagt tggctatctc tggtcaccta ccctagagat gtaaccctca agtgcaaaca 21480 attaaattat gatttttttt ctttaatgca tctaagataa aagttttttt agagacaggg 21540 ccttgctctg tcacccaggc tggagtgcag tggtgtgatc 21580 11 4610 DNA Homo sapiens exonexon junction (2769)...(2770) exon 7exon 8b 11 ccctcgcgct actgcgggag cagcgtcctc ccgggccacg gcgcttcccg gccccggcgt 60 ccccggacca tggcgctctc cgggctcttc tctagctctc agcggctgcg aagtctgtaa 120 acctggtggc caagtgattg taagtcagga gactttcctt cggtttctgc ctttgatggc 180 aagaggtgga gattgtggcg gcgattacag aaaacatctg ggaagacaag ttgctgtttt 240 tatgggaatc gcaggcttgg aagagacaga agcaattcca gaaataaatt ggaaattgaa 300 gatttaaaca atgttgtttt aaaatattct aacttcaaag aatgatgcca gaaacttaaa 360 aaggggctgc gcagagtagc aggggccctg gagggcgcgg cctgaatcct gattgccctt 420 ctgctgagag gacacacgca gctgaagatg aatttgggaa aagtagccgc ttgctacttt 480 aactatggaa gagcagggcc acagtgagat ggaaataatc ccatcagagt ctcaccccca 540 cattcaatta ctgaaaagca atcgggaact tctggtcact cacatccgca atactcagtg 600 tctggtggac aacttgctga agaatgacta cttctcggcc gaagatgcgg agattgtgtg 660 tgcctgcccc acccagcctg acaaggtccg caaaattctg gacctggtac agagcaaggg 720 cgaggaggtg tccgagttct tcctctactt gctccagcaa ctcgcagatg cctacgtgga 780 cctcaggcct tggctgctgg agatcggctt ctccccttcc ctgctcactc agagcaaagt 840 cgtggtcaac actgacccag tgagcaggta tacccagcag ctgcgacacc atctgggccg 900 tgactccagg ttcgtgctgt gctatgccca gaaggaggag ctgctgctgg aggagatcta 960 catggacacc atcatggagc tggttggctt cagcaatgag agcctgggca gcctgaacag 1020 cctggcctgc ctcctggacc acaccaccgg catcctcaat gagcagggtg agaccatctt 1080 catcctgggt gatgctgggg tgggcaagtc catgctgcta cagcggctgc agagcctctg 1140 ggccacgggc cggctagacg caggggtcaa attcttcttc cactttcgct gccgcatgtt 1200 cagctgcttc aaggaaagtg acaggctgtg tctgcaggac ctgctcttca agcactactg 1260 ctacccagag cgggaccccg aggaggtgtt tgccttcctg ctgcgcttcc cccacgtggc 1320 cctcttcacc tttgatggcc tggacgagct gcactcggac ttggacctga gccgcgtgcc 1380 tgacagctcc tgcccctggg agcctgccca ccccctggtc ttgctggcca acctgctcag 1440 tgggaagctg ctcaaggggg ctagcaagct gctcacagcc cgcacaggca tcgaggtccc 1500 gcgccagttc ctgcggaaga aggtgcttct ccggggcttc tcccccagcc acctgcgcgc 1560 ctatgccagg aggatgttcc ccgagcgggc cctgcaggac cgcctgctga gccagctgga 1620 ggccaacccc aacctctgca gcctgtgctc tgtgcccctc ttctgctgga tcatcttccg 1680 gtgcttccag cacttccgtg ctgcctttga aggctcacca cagctgcccg actgcacgat 1740 gaccctgaca gatgtcttcc tcctggtcac tgaggtccat ctgaacagga tgcagcccag 1800 cagcctggtg cagcggaaca cacgcagccc agtggagacc ctccacgccg gccgggacac 1860 tctgtgctcg ctggggcagg tggcccaccg gggcatggag aagagcctct ttgtcttcac 1920 ccaggaggag gtgcaggcct ccgggctgca ggagagagac atgcagctgg gcttcctgcg 1980 ggctttgccg gagctgggcc ccgggggtga ccagcagtcc tatgagtttt tccacctcac 2040 cctccaggcc ttctttacag ccttcttcct cgtgctggac gacagggtgg gcactcagga 2100 gctgctcagg ttcttccagg agtggatgcc ccctgcgggg gcagcgacca cgtcctgcta 2160 tcctcccttc ctcccgttcc agtgcctgca gggcagtggt ccggcgcggg aagacctctt 2220 caagaacaag gatcacttcc agttcaccaa cctcttcctg tgcgggctgt tgtccaaagc 2280 caaacagaaa ctcctgcggc atctggtgcc cgcggcagcc ctgaggagaa agcgcaaggc 2340 cctgtgggca cacctgtttt ccagcctgcg gggctacctg aagagcctgc cccgcgttca 2400 ggtcgaaagc ttcaaccagg tgcaggccat gcccacgttc atctggatgc tgcgctgcat 2460 ctacgagaca cagagccaga aggtggggca gctggcggcc aggggcatct gcgccaacta 2520 cctcaagctg acctactgca acgcctgctc ggccgactgc agcgccctct ccttcgtcct 2580 gcatcacttc cccaagcggc tggccctaga cctagacaac aacaatctca acgactacgg 2640 cgtgcgggag ctgcagccct gcttcagccg cctcactgtt ctcagactca gcgtaaacca 2700 gatcactgac ggtggggtaa aggtgctaag cgaagagctg accaaataca aaattgtgac 2760 ctatttggga ctttggaaat cagtagacac catatgcttc aaaaaacagg ggctattaaa 2820 atgacatcag gagccagaaa gtctcatggc tgtgctttct cttgaagttt atacaacaac 2880 cagatcaccg atgtcggagc cagactggga aaaaacaaaa taacaagtga aggagggaag 2940 tatctcgccc tggctgtgaa gaacagcaaa tcaatctctg aggttgggat gtggggcaat 3000 caagttgggg atgaaggagc aaaagccttc gcagaggccc tgcggaacca ccccagcttg 3060 accaccctga gtcttgcgtc caacggcatc tccacagaag gaggaaagag ccttgcgagg 3120 gccctgcagc agaacacgtc tctagaaata ctgtggctga cccaaaatga actcaacgat 3180 gaagtggcag agagtttggc agaaatgttg aaagtcaacc agacgttaaa gcatttatgg 3240 cttatccaga atcagatcac agctaagggg actgcccagc tggcagatgc gttacagagc 3300 aacactggca taacagagat ttgaacttgt ttggaacttg tcataaaatc gatcagtttg 3360 gtgaattgca accaacaata tttaaaaaga aaacagaaca gaacaaaata tcaggatgca 3420 atgtgcatgc ctaaatggaa acctgataaa accagaggag gccaaagtct atgaagatga 3480 gaagcggatt atctgtttct gagaggatgc tttcctgttc atggggtttt tgccctggag 3540 cctcagcagc aaatgccact ctgggcagtc ttttgtgtca gtgtcttaaa ggggcctgcg 3600 caggcgggac tatcaggagt ccactgcctc catgatgcaa gccagcttcc tgtgcagaag 3660 gtctggtcgg caaactccct aagtacccgc tacaattctg cagaaaaaga atgtgtcttg 3720 cgagctgttg tagttacagt aaatacactg tgaagagact ttattgccta ttataattat 3780 ttttatctga agctagagga ataaagctgt gagcaaacag aggaggccag cctcacctca 3840 ttccaacacc tgccataggg accaacggga gcgagttggt caccgctctt ttcattgaag 3900 agttgaggat gtggcacaaa gttggtgcca agcttcttga ataaaacgtg tttgatggat 3960 tagtattata cctgaaatat tttcttcctt ctcagcactt tcccatgtat tgatactggt 4020 cccacttcac agctggagac accggagtat gtgcagtgtg ggatttgact cctccaaggt 4080 tttgtggaaa gttaatgtca aggaaaggat gcaccacggg cttttaattt taatcctgga 4140 gtctcactgt ctgctggcaa agatagagaa tgccctcagc tcttagctgg tctaagaatg 4200 acgatgcctt caaaatgctg cttccactca gggcttctcc tctgctaggc taccctcctc 4260 tagaaggctg agtaccatgg gctacagtgt ctggccttgg gaagaagtga ttctgtccct 4320 ccaaagaaat agggcatggc ttgcccctgt ggccctggca tccaaatggc tgcttttgtc 4380 tcccttacct cgtgaagagg ggaagtctct tcctgcctcc caagcagctg aagggtgact 4440 aaacgggcgc caagactcag gggatcggct gggaactggg ccagcagagc atgttggaca 4500 ccccccacca tggtgggctt gtggtggctg ctccatgagg gtgggggtga tactactaga 4560 tcacttgtcc tcttgccagc tcatttgtta ataaaatact gaaaacactc 4610 12 260 DNA Homo sapiens misc_feature 38 n = A,T,C or G 12 ctcatt gtc tcc cgc cca cat tca att act gaa aag cna tcg gga act 48 Val Ser Arg Pro His Ser Ile Thr Glu Lys Xaa Ser Gly Thr 1 5 10 tct ggt cac tca cat ccg caa tac tca gtg tct ggt gga caa ctt gct 96 Ser Gly His Ser His Pro Gln Tyr Ser Val Ser Gly Gly Gln Leu Ala 15 20 25 30 gaa gaa tga cta ctt ctc ggc gga aga tgc gga gat tgt gtg tgc ctg 144 Glu Glu Leu Leu Leu Gly Gly Arg Cys Gly Asp Cys Val Cys Leu 35 40 45 ccc cac cca gcc tga ca ggtgccccgg ggacagggac gggcatgggc 191 Pro His Pro Ala ttgtgtggac accgggagct agaagagcct ctcctgctgg tctgagtgaa gagctgggag 251 ttacgtccg 260 13 13 000 14 248 DNA Homo sapiens misc_feature 8 n = A,T,C or G 14 cagggtantg gacagtgcaa ggaaagaaag aagctgcggt gaaagtgaca ctgagtgatt 60 gtaagtcagg agactttcct tcggtttctg cctttgatgg caagaggtgg agattgtggc 120 ggcgattaca gaaaacatct gggaagacaa gttgctgttt ttatgggaat cgcaggcttg 180 gaagagacag aagcaattcc agaaataaat tggaaattga agatttaaac aatgttgttt 240 taaaatat 248 15 34001 DNA Homo sapiens unsure (425)...(524) unknown 15 gacttcatgt ctaaaacacc aaaagcaatg gcaacaaaag ccaaaattga caaatgggat 60 ctaattaaac taaagagctt ctcacagcaa aagaaactac catcagagtg aacaggcaac 120 ctacagaatg gggggaaaaa atttgcaatc tactcatctg acaaagggct aatatccaga 180 atctacaagg aactgaaaca aatttacagg aaaaaaacaa acaaccccat caaaaagtgg 240 gcgaaggata tgaacagaca cttctcaaaa gaagatattt atgcagccaa cagtcacatg 300 aaaaagtgct catcaccact ggccatcaga gaaatgcaaa tcaaaaccac aatgagatac 360 catctcacac tagttagaat ggcaatcatt aaaaagtcag gaaacaacta ggtgctggat 420 gtagnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngatcat cgttcactgc 540 agccttgaac tcttgtgctc atgtgatcct cctgccttag cctccccaat agctgggact 600 acaggtgcgc caccatgcct ggctaatttt ttttattttt gtagagatgg gtgtctcact 660 atgttgcaca ggttggtctc aaactactgg ccttacttca agctatctac ccatctcagc 720 ctcccaaagc gctgggatta cagtcatgag ccaacttgcc tggccagata aaggtcttaa 780 gcatggttcc ttcctgctct aggtagagaa accccacaac cagtgggagg tggggtgagc 840 tctttctgta gcttttgctt tgctgatgat gtcattgatc tcttcagggg ctgcgcagag 900 tagcaggggc cctggagggc gcggcctgaa tcctgattgc ccttctgctg agaggacaca 960 cgcagctgaa gatgaatttg ggaaaagtag ccgcttgcta ctttaactat ggaagagcag 1020 ggccacagtg agatggaaat aatcccatca gagtctcacc cccacattca attactgaaa 1080 agcaatcggg aacttctggt cactcacatc cgcaatactc agtgtctggt ggacaacttg 1140 ctgaagaatg actacttctc ggccgaagat gcggagattg tgtgtgcctg ccccacccag 1200 cctgacaagg tgccccgggg acagggacgg gcatggcatt gtgtggaccc cgggagctag 1260 aagaggcctc tccctgctga tctgagtgaa gagcgtggga gtttagtcca gcgggcaggg 1320 ctgcattttg gggtactaat agcacacaaa tgcctgggtt agcaggttgc acagtcaggt 1380 attttacttc tgtgtttgtg tctggagcaa accctgacat ctcagttctc attgctgtgt 1440 gtattggttc ccagacactt catttttaga tcccctttaa attaggaggg aaaaagaaca 1500 taagcataag agcatcccca gcagcgatgt tcattcagtg cctctgaagg ctggagggct 1560 gcttgttgct gggtgagact cggaggggaa ccgactcagg gtcaggaatg atgacatccc 1620 acggtgggtc cacagtgaag aatcttcccc gctccactgt gggacgcctt aacagccctt 1680 acttccactt acgctttgcg ttatctcctg aaaaataaaa tggagaccac aaattccttc 1740 ttggttagag gaatgacaca actcatttat gacatgaccc cgctgggact cagaagagac 1800 caggacggtt tctgggggaa gcagtagcac actcgtgtgc tttgttctct tctcttgatt 1860 tgttttccca catttttaac aagaaaaaaa gccgttttta atatatggcc tatcgccctc 1920 ctactgtgtg gcccaggtgc ctacctcatt atgcccaagg ggtggttctc acctctccac 1980 tctcattcct gcacagcagt tgtgtcaggt taagagggac aaggagaagg ctgggcaccg 2040 tggctcacgc ctgtaatccc agcactttgg gaggccgagg caggcagatc acctaaggtc 2100 aggagtttga gaccagcctg gccaacatgg ggaaaacccg tctctaataa aaacacaaaa 2160 attagtcggg catggtggtg ggtgcctgta atcccagcca cttgggaggc tgaggaaaga 2220 gaattccttg aacctgggag gtggaggttg cagtgagcca agattgtgcc attgcactcc 2280 agccctccag cctgggtgac agagcaagac tctgtctcaa aaaagaaaaa aaaaaaaaag 2340 aggtagagaa gtccatggct atttgtctgt cctttttatt tttaggctca tggaagcctc 2400 ctggtttctt agagctgagt ggttttattt cttgctcagg aggtcatttc acagattttc 2460 gggctccaat atgttgactg tcacagcagc tggggggatg gcatagctac cggctgtact 2520 aagaactcag agccctgccc tgagcctgcc tgagggtcct tatggtagga ggatgcccct 2580 catgccagcc cgtgccctca tgcttgtgtc acctccaggt ccgcaaaatt ctggacctgg 2640 tacagagcaa gggcgaggag gtgtccgagt tcttcctcta cttgctccag caactcgcag 2700 atgcctacgt ggacctcagg ccttggctgc tggagatcgg cttctcccct tccctgctca 2760 ctcagagcaa agtcgtggtc aacactgacc caggtaggag tcagccccag caagaccgca 2820 ggcaccagtg caagcagggc cctggggggt ttggtaatgg ctgggccagc cctgagtgcc 2880 acctcaggaa gcaggcccag gtgctatttt gattttagaa aggaacagct gaatcctgtc 2940 tcccaagtgc agcccaggtg gctgcgattg aactgcccac acctcgatgg tctggtttat 3000 agaggggcct ttggaagtat gggaatggcc tgtgttctga ccccttgctt tcttcctatt 3060 ctgacatatg tagacatttt aatggttgca caaattcaag gttgtatttt tttttctttt 3120 aaaaaaatct ttagctggac atggtagcac acacctgtag ttccagctac tcaggaggct 3180 gaggcaagag gactgcttga gccccagagt ctaaggctgc agcgagctat gattgtgccc 3240 ctacactcca gcctgggtga cagagtgaga ccctgtctct aaaaaaggaa agaaaaaaat 3300 taaaaagcct tgccaggttt gattctaggc aaagtattct gtcaccgttg agtgccagtc 3360 cttatttcca aactaatgga agaccccatc agttaactga ttagttcaat aagtattttt 3420 tgctgtatcc accacatgcc aagaccctac actgtgctgg atgtcaggga gacagtggtg 3480 agcagacaca gacagggttc ctgccctcag ggagcttcaa gtcagctgga agagaccacc 3540 agtcagcaat ctcaaaaatg tgtcaggaca gcggcagtcc aaggcatgtg agaacatatc 3600 attagggcca ggatctgctc tggggcagga gtcttctttc cctgcttttg aactctccac 3660 tttgagacag ctgttggtaa cataccagca ccaaggacct aagtcctgcc ttttaaagaa 3720 tccaatatgt tgttggaaac agaagcacaa gacaggtgtg tgcttagggg aaacaaggcc 3780 agccggcaga gtgtcagtgc taggctccag cttccacagc ccctgcaggt gcctgccagc 3840 cactgctagc ttctgactct gtctgctcct tcctgtctcc ccttgtttcc ttcccccatg 3900 aaaaaaaaag aaagtattcc catgaggaat cattctttcg aaagacttct ctgttggttc 3960 cgttagccag ctactttact agcttttaca gtgtaattca ctctacaagc agtctcacac 4020 aaaagactac atattgtatg attctgttta tatgaaatgt ccagaaaagg taaatctata 4080 gacaaagcaa atcagtagtt gcctacggcc cagggattgg ctacaaatag gctccagaaa 4140 actctgggaa gatggtagag atgttctaga cctggactgt ggtgaggttt gcacaacttt 4200 gtaaacttac taaaaattac tgacaaatat ataacactcc ctaacacttt gggaggccga 4260 ggtgggcaga tcgcttgaac ccaggaattt gagaccagcc tgggcaacat ggcgagaccc 4320 cgtctctaca aaaaaacaca aaaattagtt gggcttggtg gcatatgcct gtgtcccagc 4380 tacttgggag gctgaggtgg gaggattgct tgagcctggg agtttgagac tgcatgattg 4440 ggtcactgca ccctagcctg agtgacagag caggacccta tctctaacaa caaaaaagca 4500 gtgttggtgg aggagggcca gcgtggccat ctggcctggc cctcgagtgc gaggggcttc 4560 agtgtttagc tgcagttcag tgatgacact gtgcggagga ataagggtgg cctgtctcag 4620 acactgatcc cagctgaagt ttgtcacctt ctttctggca aatctgaggt caagcagaga 4680 gatcaaagcc tggggccctc agggtcagga atgctggctc tgtgacgctc cccaggtcct 4740 gcatctgagg agtggctgcg ctggcctcag ggcccaggtt gtgaattttg tttatgcact 4800 cgcctctcct ctttgagacc tccctgtttg atgctgtttc tgcctctctc ctcaccctgc 4860 tgctgtgccc tgccaccccc tccctccagt gagcaggtat acccagcagc tgcgacacca 4920 tctgggccgt gactccaagt tcgtgctgtg ctatgcccag aaggaggagc tgctgctgga 4980 ggagatctac atggacacca tcatggagct ggttggcttc agcaatgaga gcctgggcag 5040 cctgaacagc ctggcctgcc tcctggacca caccaccggc atcctcaatg agcagggtga 5100 gaccatcttc atcctgggtg atgctggggt gggcaagtcc atgctgctac agcggctgca 5160 gagcctctgg gccacgggcc ggctagacgc aggggtcaaa ttcttcttcc actttcgctg 5220 ccgcatgttc agctgcttca aggaaagtga caggctgtgt ctgcaggacc tgctcttcaa 5280 gcactactgc tacccagagc gggaccccga ggaggtgttt gccttcctgc tgcgcttccc 5340 ccacgtggcc ctcttcacct tcgatggcct ggacgagctg cactcggact tggacctgag 5400 ccgcgtgcct gacagctcct gcccctggga gcctgcccac cccctggtct tgctggccaa 5460 cctgctcagt gggaagctgc tcaagggggc tagcaagctg ctcacagccc gcacaggcat 5520 cgaggtcccg cgccagttcc tgcggaagaa ggtgcttctc cggggcttct cccccagcca 5580 cctgcgcgcc tatgccagga ggatgttccc cgagcgggcc ctgcaggacc gcctgctgag 5640 ccagctggag gccaacccca acctctgcag cctgtgctct gtgcccctct tctgctggat 5700 catcttccgg tgcttccagc acttccgtgc tgcctttgaa ggctcaccac agctgcccga 5760 ctgcacgatg accctgacag atgtcttcct cctggtcact gaggtccatc tgaacaggat 5820 gcagcccagc agcctggtgc agcggaacac acgcagccca gtggagaccc tccacgccgg 5880 ccgggacact ctgtgctcgc tggggcaggt ggcccaccgg ggcatggaga agagcctctt 5940 tgtcttcacc caggaggagg tgcaggctcc gggctgcagg agagagacat gcagctgggc 6000 ttcctgcggg ctttgcggag ctgggccccg ggggtgacca gcagtcctat gagtttttcc 6060 acctcaccct ccaggccttc tttacagcct tcttcctcgt gctggacgac agggtgggca 6120 ctcaggagct gctcaggttc ttccaggagt ggatgccccc tgcgggggca gcgaccacgt 6180 cctgctatcc tcccttcctc ccgttccagt gcctgcaggg cagtggtccg gcgcgggaag 6240 acctcttcaa gaacaaggat cacttccagt tcaccaacct cttcctgtgc gggctgttgt 6300 ccaaagccaa acagaaactc ctgcggcatc tggtgcccgc ggcagccctg aggagaaagc 6360 gcaaggccct gtgggcacac ctgttttcca gcctgcgggg ctacctgaag agcctgcccc 6420 gcgttcaggt cgaaagcttc aaccaggtgc aggccatgcc cacgttcatc tggatgctgc 6480 gctgcatcta cgagacacag agccagaagg tggggcagct ggcggccagg ggcatctgcg 6540 ccaactacct caagctgacc tactgcaacg cctgctcggc cgactgcagc gccctctcct 6600 tcgtcctgca tcacttcccc aagcggctgg ccctagacct agacaacaac aatctcaacg 6660 actacggcgt gcgggagctg cagccctgct tcagccgcct cactgttctc aggtgaggct 6720 gccaggcaag gggagcaaca ggtgggccgg gcgggccagg ctcggagggc atcgggaatg 6780 gcatcatgga ccaggatccc ccaggactca tgaccatggc ccttggaatg tccagacctt 6840 ttctttctta gcagggcaga ggtcaaggtg caaagcttcg aggcaggtgg acctggatca 6900 gccacagctg ggtgcccttg aacaaagtgc ttaactctca gagcctccac gccctcatct 6960 ggaaaaagaa gatgctcata atcctatcaa ttatggccac agggaccaat gttagttgag 7020 aatgggtgaa gtgcattaca aatattacct aatggaatgc tctttacaac cctgtaactt 7080 aggtactgtt attgtctcta ttttggcaga taaggaagta gaggcacaga gaagttaata 7140 gcttgcttta ggtcacacag ctcagacata gcagtgccag aatgcataaa gaaccttcct 7200 tttaagatta atgtaaggct ccgagatagc cctcaaaaag tttctggaat atgggagctt 7260 ttattactgc agagaaagca gaccttgtgc cagttggcac tggtgacttt ctgtgatcaa 7320 cgctagcagc ccttcacact gctagagacc tcagttaaaa tgctgactcg tggttgtttt 7380 cctgttccat agtttacggg aaacagagcc cagtctgttt tcttctatta gcatttccta 7440 tgtaaaataa accttgtaaa tctctacagg gggttaaatt tgccattact tgactcatgc 7500 atttctaaaa agcagtaggg atttggaact gactcccagt gcctgtcaca ccagtgtcag 7560 agtgtaaata attgcatggg gacatggggt gcagggggtc gaaggctgcc ctagcctggg 7620 aattggaaaa cctggagtct gttctctgta ctctcagcca gtgactctcc ctctgtagcc 7680 ccaggcagtc tcacactcag tgccaccctc tgtccatctt ttttttttct cccccaaatg 7740 gagtcccgct ctgttgccca ggctggagtg cagtggcgtg atctcagctc actgcaacct 7800 ccgcctcctg ggttcaagcg attctcttgc cccagcctcc tgagtagctg ggattacagg 7860 cacacgccac catgtccggc taagtttttt gtatttttag taggacgggg tttccccatg 7920 ttggccaggc tggtcttgaa atcctgacct caggtgatcc gcccgcctcg gccttccaaa 7980 atgttggggt tacaggcatg agccgccgca cccgacccct ctgtccatct tttcaatggg 8040 aaactccaca ccagtgtggt ggccctgccc ttcctgctgt ccccaggtga agctttcctt 8100 cacaccagtg caagaaaaaa cagcttgtag gaaagcagag gatatgggta accacgggaa 8160 gcacactcag ttctctggct gcatcagtta ggattagttt tagctgagag cgaaaacccc 8220 aaatgttggt gagttacaag cttatttctc tcatgtaaaa gtctagaggt aggtagttca 8280 ggactggtat ggagtctcca tgaccctccg gagcccaggc tctcttctgc cttcctgttc 8340 tgccatcctc actacccggc tttcccatct tggcccaaga gggctgctca aactccagcc 8400 atctagtcga cactctagct atcagtaaga aggaagggca aagattgaga gcatgcctca 8460 atcttttaag aacacttctt ggctattact aattatattg ctgcttagat ttcagaactt 8520 aatggtatgg gcagaattta atgagatggg cccagctaaa agatggggga atctattgct 8580 aagaaagtat agatattggg aatgtctagc agcctgtgct gtcttgggct ggccatgcca 8640 tgtacataca cactatttcc cagcaccaag ctggggactc tgagggaaag ggtccagagt 8700 gtctgacttg atcattttga tgtggcctaa aaatcaagct tttaattgtt cagcctttta 8760 cttgttatca aggtcagctt gtgggtctaa ttgggcccaa ggcttgtgtt tctaagtaaa 8820 gttttattgg aacgcagcca tacccattta tttacttact ggctgcttca cactacacag 8880 ttgagtagct gtgacagaga ccacatggcc cacagagcct aaaatatttg ctgtctgaca 8940 ctttacagaa tgacatgagc agtctccttt gacagtggga ctcacagcct tttccagtga 9000 caaatcaggg ttagcccatg tgtttctgga tggggggaag ctgttggcat tttgggtata 9060 acagttcttg tgagacctgt ccagcatttt gcaggacacc taacatcatt ggccctgcct 9120 gcaagatgac agggcactcc ctcctccagt cacaaccact aaaagcagcc cctgacattt 9180 ccaaacccat gccctccacc atacgagaac caggtacagg gtctggctga cacataggtc 9240 acacgcaaag ggtggatgtc agaggtggct ggcctcacac gtcctccctg tgtccttcac 9300 ggtcgtgtga ggagccaggg gctgtgctgc agcctcgctc atgggctggt gcaggatggg 9360 tctggcggcc ccacgttggc caggctttgt aaggggctat ttggctgatt gctgtggcca 9420 ttctccaggg gcgtctatac ctgagaaaac tccagggcct gaaggcttct ggatctttgt 9480 aagattaatg gtccttcata atgagtgcct gccctgactc gtaatttttt tgctgtttta 9540 tttcagactc agcgtaaacc agatcactga cggtggggta aaggtgctaa gcgaagagct 9600 gaccaaatac aaaattgtga cctatttggg gtatgtcttt ctccagaaca ctgggccaac 9660 tacctagtaa taatacagag ctgcagggaa ttcacattcc cataggtccc tggatgatcg 9720 gcacggatgg cccagggctg ggaagagcgc tggcccagga gttgagagtc ctgggttctc 9780 tttgtggctc ggccagtcat gaagtcttgc tgagcctcag cctcctcacc tgtaaaactg 9840 ggatcccagt ataggcaagt aggcttacaa ctggttattg ggggatgcaa cgagaatata 9900 aggggatata tttaataaat gctagaatcc tgtttacata ttagtctgga ctattttggg 9960 tccataatcc ctcatccaga gcctttgggg caagacccga atggggattc tgagtgcatg 10020 ctatggcatg acgtggccgc aggggtctaa ggcagtgccc cattttcaaa cactttcata 10080 tttctcccgc agaatgtatg aaacagtcaa accaagtgtg gtaagaaaga ctataagtag 10140 ctccacatca gttgccaaaa gaattgtgag aaactttggg cattcagagc ctttgaggtt 10200 ttggagtctg agagaaggga ttgcgggcca gccccacaca actggtggct ctgcaagctg 10260 gagcagttgt tcagtttctt ggggcctcag tggccttcga tgttaatgag gacatggacg 10320 caaacgaccc cgggccacac tcggctccag ggctctgtgt ggctgtggaa ccctggaagc 10380 ctgagcttag ctgcctttca acttccatct gctgtactat tgaattggca ttgagcggtg 10440 agatggctga aaggtagaca tcgagaagtt ttaatattca gaatcttttc ttctcaagac 10500 gctgaatgta atcttagttg taaataccca tcacctgcca gtcaccgagc actcatgcac 10560 cagggctttg cgttatgtcc taagatcctc ataaccaccc tgcaagggga ctatcatcat 10620 tacctctgta ttacagatgg agaaactgag gcacagagag gtaacgtgac ttgtctcagg 10680 ccataaagct ggggaaagta gtggagctgg ttttgaacct gagctgtgag acctcagagc 10740 cctaaactct ggtgcctgtg tgttcccctt tcaacccaga ctttggaaat cagtagacac 10800 catatgcttc aaaaaacagg ggctattaaa atgacatcag gagccagaaa gtctcatggc 10860 tgtgctttct cttgaagttt atacaacaac cagatcaccg atgtcggagc caggtacgtc 10920 accaaaatcc tggatgaatg caaaggcctc acgcatctta agtaagtggg gtaggcacca 10980 ggttccttag tatattctct tgatcacccc cttctgttgt tcaaagatta aatgtcacag 11040 taaagagctt tcatcctaaa gccttccact tgtcccaggg ccatgttggt caagtaaaga 11100 tacctctgtg tgatctgtga ggcttggatt ctggaagggc ctcccgttat tggtaggggg 11160 aaaggttggc attttgattt cattaactac taggccgaag aaaggactaa ctctcaccct 11220 ttctggtggt ctttttgccc caagggagtt tcctgtcggg ttgcaaggaa gagcttgggc 11280 ccttgccctg ctgtaggtgt gccctgcgca gggggtgaca gtgcgccagg cttggagcct 11340 ctggtcctgc cctgacagtg gccacatacc ttgacccttg gcagtcaaag tgggacctcc 11400 caggtctccc gagggaagtc agtgatgctg ctgaggtcaa ttagaggacc ccagggaggg 11460 ctcaggtccc tgagcttctg cagagactgt ggaccatctc ctggagagga accctgactg 11520 actgtcctca gggcttcagt tccctccctg acaggaggcc caggccatgg ctcttgtgga 11580 tcccagaaga aagtgtacgg ttcccaagat ggggctggaa ggggctctgt gctggggagg 11640 agggtgaccc acattggagc ccctgcatag ctggaggctg actgtgtgtg actctctctg 11700 cagactggga aaaaacaaaa taacaagtga aggagggaag tatctcgccc tggctgtgaa 11760 gaacagcaaa tcaatctctg aggttgggtg agtagaaggg gatggatgta tgtggtacaa 11820 cctgctgtgt gtgtgggggg cgggccttgc tgttcttttc atacatcagt acaccagaag 11880 gaccactggg gctcgctgtc ggggagagat agtggagagc tttcaccatg ctgcgaaact 11940 gaaaccgtgc ccattaagca ataactcccc ggtccccctc ccccctgcct cttgcagcca 12000 ccctgctact tactctctct atggttttga ctactctacc tcatgtaagt ggaatcatac 12060 agtatttgcc ttttggggat ggctgatttc actagcatca tgtcctcaag attcgtccac 12120 atggaagcat gggacaggat ttcctttttt tttttttttt tttttttttt tgacagagtc 12180 tcgctctgtt gcccaggctg gagtgcagtg gcatgatctc ggctcactgc aacctctgcc 12240 ttctgggttc aagcgattct ctcgcctcag ccacacgagt agctgggatt ataggcaccc 12300 gccaccaatc ccagctaatt tttgtatttt tagtagaggc ggggtttcac catgttggcc 12360 aggctggtct caaactcctg acctcaaatg atccacccac ctcggtctcc caaagtgtca 12420 ggattatagg cgtgagccac cgtgccccgc caggatttcc ttctttttta aggctgagta 12480 atactccatt gcatggctat gccacatttt gtttactcat tcatccaaga acagacactg 12540 gcttgcttct atgctttggc tgttgtgaat aatgctgctg tgcacatggg catacaaatg 12600 tctcttcaag gactgccttc aattcttttt tttttttttt ttttttttta gattcttttt 12660 ttttttatta tactctaagt tttagggtac atgtgcacat tgtgcaggtt agttacatat 12720 gtatacatgt gccatgctgg tgcgctgcac ccactaatgt gtcatctagc attaggtata 12780 tctcccaatg ctatccctcc cccctccccc gaccccacca cagtccccag agtgtgatat 12840 tccccttcct gtgtccatgt gatctcattg ttcaattccc acctatgagt gagaatatgc 12900 ggtgtttggt tttttgttct tgcgatagtt tactgagaat gatggtttcc aatttcatcc 12960 atgtccctac aaaggatatg aactcatcat tttttatggc tgcatagtat tccatggtgt 13020 atatgtgcca cattttctta atccagtcta tcattgttgg acatttgggt tggttccaag 13080 tctttgctat tgtgaatagt gccacaataa acatacgtgt gcatgtgtct ttatagcagc 13140 atgatttata ctcatttggg tatataccca gtaatgggat ggctgggtca aatggtattt 13200 ctagttctag atccctgagg aatcgccaca ctgacttcca caatggttga actagtttac 13260 agtcccacca acagtgtaaa agtgttccta tttctccgca tcctctccag cacctgctgt 13320 ttcctgactt tttaatgatt gccattctaa ctggtgtgag atgatatctc atagtggttt 13380 tgatttgcat ttctctgatg gccagtgatg atgagcattt cttcatgtgt tttttggctg 13440 cataaatgtc ttcttttgag aagtgtctgt tcatgtcctt cgcccacttt ttgatggggt 13500 tgtttgtttt tttcttgtaa atttgtttga gttcattgta gattctggat attagccctt 13560 tgtcagatga gtaggttgcg aaaattttct cccatgttgt aggttgcctg ttcactctga 13620 tggtagtttc ttttgctgtg cagaagctct ttagtttaat tagatcccat ttgtcaattt 13680 tgtcttttgt tgccattgct tttggtgttt tggacatgaa gtccttgccc acgcctatgt 13740 cctgaatggt aatgcctagg ttttcttcta gggtttttat ggttttaggt ttaacgttta 13800 aatctttaat ccatcttgaa ttgatttttg tataaggtgt aaggaaggga tccagtttca 13860 gctttctaca tatggctagc cagttttccc agcaccattt attaaatagg gaatcctttc 13920 cccattgctt gtttttctca ggtttgtcaa agatcagata gttgtagata tgcggcatta 13980 tttctgaggg ctctgttctg ttccattgat ctatatctct gttttggtac cagtaccatg 14040 ctgttttggt tactgtagcc ttgtagtata gtttgaagtc aggtagtgtg atgcctccag 14100 ctttgttctt ttggcttagg attgacttgg cgatgcgggc tcttttttgg ttccatatga 14160 actttaaagt agttttttcc aattctgtga agaaagtcat tggtagcttg atggggatgg 14220 cattgaatct gtaaattacc ttgggcagta tggccatttt cacgatattg attcttccta 14280 cccatgagca tggaatgttc ttccatttgt ttgtgtcctc ttttatttcc ttgagcagtg 14340 gtttgtagtt ctccttgaag aggtccttca catcccttgt aagttggatt cctaggtatt 14400 ttattctctt tgaagcaatt gtgaatggga gttcacccat gatttggctc tctgtttgtc 14460 tgttgttggt gtataagaat gcttgtgatt tttgtacatt gattttgtat cctgagactt 14520 tgctgaagtt gcttatcagc ttaaggagat tttgggctga gacgatgggg ttttctagat 14580 aaacaatcat gtcgtctgca aacagggaca atttgacttc ctcttttcct aattgaatac 14640 cctttatttc cttctcctgc ctgattgccc tggccagaac ttccaacact atgttgaata 14700 ggagcggtga gagagggcat ccctgtcttg tgccagtttt caaagggaat gcttccagtt 14760 tttgcccatt cagtatgata ttggctgtgg gtttgtcata gatagctctt attattttga 14820 aatacgtccc atcaatacct aatttattga gagtttttag catgaagggt tgttgaattt 14880 tgtcaaaggc tttttctgca tctattgaga taatcatgtg gtttttgtct ttggctctgt 14940 ttatatgctg gattacattt attgatttgc gtatattgaa ccagccttgc atcccaggga 15000 tgaagcccac ttgatcatgg tggataagct ttttgatgtg ctgctggatt cggtttgcca 15060 gtattttatt gaggattttt gcatcaatgt tcatcaagga tattggtcta aaattctctt 15120 ttttggttgt gtctctgccc ggctttggta tcagaatgat gctggcctca taaaatgagt 15180 tagggaggat tccctctttt tctattgatt ggaatagttt cagaaggaat ggtaccagtt 15240 cctccttgta cctctggtag aattcggctg tgaatccatc tggtcctgga ctctttttgg 15300 ttggtaaact attgattatt gccacaattt cagagcctgt tattggtcta ttcagagatt 15360 caacttcttc ctggtttagt cttgggagag tgtatgtgtc gaggaatgta tccatttctt 15420 ctagattttc tagtttattt gcgtagaggt gtttgtagta ttctctgatg gtagtttgta 15480 tttctgtggg atcggtggtg atatcccctt tatcattttt tattgtgtct atttgattct 15540 tctctctttt tttctttatt agtcttgcta gcggtctatc aattttgttg atcctttcga 15600 aaaaccagct cctggattca ttgatttttt gaagggtttt ttgtgtctct atttccttca 15660 gttctgctct gattttagtt atttcttgcc ttctgctagc ttttgaatgt gtttgctctt 15720 gcttttctag ttcttttaat tgtgatgtta gggtgtcaat tttggatctt tcctgctttc 15780 tcttgtaggc atttagtgct ataaatttcc ctctacacac tgctttgaat gcgtcccaga 15840 gattctggta tgtggtgtct ttgttctcgt tggtttcaaa gaacatcttt atttctgcct 15900 tcatttcgtt atgtacccag tagtcattca ggagcaggtt gttcagtttc catgtagttg 15960 agcggctttg agtgagattc ttaatcctga gttctagttt gattgcactg tggtctgaga 16020 gatagtttgt tataatttct gttcttttac atttgctgag gagagcttta cttccaacta 16080 tgtggtcaat tttggaatag gtgtggtgtg gtgctgaaaa aaatgtatat tctgttgatt 16140 tggggtggag agttctgtag atgtctatta ggtctgcttg gtgcagagct gagttcaatt 16200 cctgggtatc cttgttgact ttctgtctca ttgatctgtc taatgttgac agtggggtgt 16260 taaagtctcc cattattaat gtgtgggagt ctaagtctct ttgtaggtca ctgaggactt 16320 gctttatgaa tctgggtgct cctgtattgg gtgcataaat atttaggata gttagctcct 16380 cttgttgaat tgatcccttt accattatgt aatggccttc tttgtctctt ttgatctttg 16440 ttggtttaaa gtctgtttta tcagagacta ggattgcaac ccctgccttt ttttgttttc 16500 cattggcttg gtagatcttc ctccatcctt ttattttgag cctatgtgtg tctctgcacg 16560 tgagatgggt ttcctgaata cagcacactg atgggtcttg actctttatc caacttgcca 16620 gtctgtgtct tttaattgca gaatttagtc catttatatt taaagttaat attgttatgt 16680 gtgaatttga tcctgtcatt atgatgttag ctggcgattt tgctcattag ttgatgcagt 16740 ttcttcctag tctcgatggt ctttacattt tggcatgatt ttgcagcggc tggtaccggt 16800 tgttcctttc catgtttacc gcttccttca ggagctcttt tagggcaggc ctggtggtga 16860 caaaatctct cagcatttgc ttgtctataa agtattttat ttctccttca cttatgaagc 16920 ttagtttggc tggatatgaa attctgggtt gaaaattctt ttctttaaga atgttgaata 16980 ttggccccca ctctcttctg gcttgtaggg tttctgccga gagatccgct gttagtctga 17040 tgggctttcc tttgagggta acccgaactt tctctctggc tgcccttaac attttttcct 17100 tcatttcaac tttggtgaat ctgacaatta tgtgtcttgg agttgctctt ctcgaggagt 17160 atctttgtgg cgttctctgt atttcctgaa tctgaacgtt ggcctgcctt gctagattgg 17220 ggaagttctc ctggataata tcctgcagag tgttttccaa cttggttcca ttctccacat 17280 cactttcagg tacaccaatc agacgtagat ttggtctttt cacatagtcc catatttctt 17340 ggaggctttg ctcatttctt tttattcttt tttctctaaa cttcccttct cgcttcattt 17400 cattcatttc atcttccatt gctgataccc tttcttccag ttgatcgcat cggctcctga 17460 ggcttctgca ttcttcacgt agttctcgag ccttggtttt cagctccatc agctccttta 17520 agcacttctc tgtattggtt attctagtta tacattcttc taaatttttt tcaaagtttt 17580 caacttcttt gcctttggtt tgaatgtcct cccgtagctc agagtaattt gatcgtctga 17640 agccttcttc tctcagctcg tcaaaatcat tctccatcca gctttgttct gttgctggtg 17700 aggaactgcg ttcctttgga ggaggagagg cgctctgcgt tttagagttt ccagtttttc 17760 tgttctgttt tttccccatc tttgtggttt tatctacttt tggtctttga tgatggtgat 17820 gtacagatgg gttttcagtg tagatgtcct ttctggttgt tagttttcct tctaacagac 17880 aggaccctca gctgcaggtc tgttggaata ccctgccgtg tgaggtgtca gtgtgcctct 17940 gctggggggt gcctcccagt taggctgctc gggggtcagg ggtcagggac ccacttgagg 18000 aggcagtctg cccgttctca gatctccagc tgcgtgctgg gagaaccact gctctcttca 18060 aagctgtcag acagggacac ttaagtctgc agaggttact gctgtctttt tgtttgtctg 18120 tgccctgccc ccagaggtgg agcctacaga ggcaggcagg cctccttgag ctgtggtggg 18180 ctccacccag ttcgagcttc ccggctgctt tgtttaccta agcaagcctg ggctatggcg 18240 ggcgcccctc ccccagcctc gttgccgcct tgcagtttga tctcagactg ctgtgctagc 18300 aatcagcgag attccgtggg cgtaggaccc tctgagccag gtgtgggata tagtctcgtg 18360 gtgcgccgtt tcttaagccg gtctgaaaag cgcaatattc gggtgggagt gacccgattt 18420 tccaggtgcg tccgtcaccc ctttctttga ctcggaaagg gaactccctg atcccttgcg 18480 cttcccaggt gaggcaatgc ctcgccctgc ttcggctcgc gcacggtgcg cgcacacact 18540 ggcctgcgcc cactgtctgg cgctccctag tgagatgaac ccggtacctc agatggaaat 18600 gcagaaatca cccgtcttct gcgtcgctca cgctgggagc tgtagaccgg agctgttcct 18660 attcggccat cttggctcct ccctccaatt cttttgggta tatatccagc agtgggattg 18720 ctggatcaca tggtaatttt taattttttg aagaatcatc atactgtttt ccacggcagc 18780 agcaccattt tatgttccca ccaacagttc attctagttt ctccacatcc ttgccaacac 18840 ttgctatttt ctctttttga cagtacccat cctaatgagt gtgaggtcct gtctcattgt 18900 ggttttgatt cttgaggctt tttaaagctt tttgtttcat tataattttt attggattac 18960 aaaaggaaca caggtaattt tatttggaaa ctatgaaaaa taataaaaat tatcttctca 19020 gaaaatgatt cttgttaaca tttaagctca gttaagctct ctcactttct ctcccttctc 19080 tctctttgta caacttttaa aaaatatagt aggggtgaga ctatatgtat ctatactata 19140 gtaggggtga gactatatgt atccttcctt tttcacttaa tctcatgcct tgagtagctt 19200 tccactttat taaaaatgtg atgccattca attgtatagt aaatacatat atgtaagcaa 19260 aacactgaaa actcttattc tgggttccag caagccatac ctggaatggt gtaagcaggt 19320 agtttgcttg gtgtgaacgt gttgttgagg cagctgccat tgtgttgtga gtgggccaca 19380 cgaacttgtt ctgttgtgtg tagacagtgt gtgctgatcc tattaggaac agccaacgct 19440 ttgtgtgagc cacacacggt tctaagtgct ttgcttctgt taactcagtg aatcctcaca 19500 actccatgac ggaatgctct aattatcccc attttataga tggggcaact gaggtccaag 19560 agactacata atttcccgaa gttcacacag gtagcagatg gcagagccgg gtcaggagtc 19620 caccatctta ccacgcagac tgttttagcc agagactctc cggatctgct gtaggggaca 19680 gaatacagct ttatcgccgc acctgtccac caagatggcc gtagccacag agcttggttg 19740 ggtaacgtcc tctttatgtg acaggaacgt tgctgatggg gtttctgaag gtacttcctg 19800 ctctttgtct cctggaagac tgtgtcttca ggaatgtctc tgaccctgcc cagagttgaa 19860 cggatgctgg gaacccagca cctgcacacg gccttccctc caggactctg cgcacctctg 19920 tgctccacag gagacatgca ggtgctttct ctcatgagct caggctcctg ggctgacagc 19980 tctccgaagc tcgtggtgag gctcggtctc taactgtgcc acttgccgat ggcctctgtt 20040 cacaaggctt cccctgctct tcgatcttgc atcacccctt gaatttgaaa tccagagcag 20100 cccactcaga gaccagtgtg aggaattagt gtccaggcca cagatccagg gactgggcac 20160 aaacatctgc ctgttgagta ggaactgagc tgtggccatt ggcaaaaaag gaggggtgag 20220 catggctgtt tcttggggag ctaacattca ctatcttgtc tcctccctca ggatgtgggg 20280 caatcaagtt ggggatgaag gagcaaaagc cttcgcagag gctctgcgga accaccccag 20340 cttgaccacc ctgaggtaac tgtggccctg ctgtctccag gggccaacct ggtccctccc 20400 agctgctcta ggtttgctgg ggaagggtga ttcgtgctcc taatagaaga ggaatttgca 20460 tgtgtgattt tccttactct tgtcaaacct ttctttgatg cataagaggc catctagtaa 20520 agcacattct tctctttttt taactttaag ttctgggata catgtagaag atgtgcaggt 20580 ttgttacata ggcaaatgca tgccatggtg atttgctgca cctatcaacc tgtcatctag 20640 gttttaagcc ctgcatgcat taggtatttg tcctaatgct tgccctcccc ttgcccccca 20700 cccccaacag gccctggtgt gtgttgttcc cctccatgtg tccatgtgtt ctcattgttc 20760 aactcccact tacgagtgag aacatgcagt gtttggtttt ttgttcctgt gttagtttgc 20820 tgagaatgat ggtttccagc ttcatccatg tgccagcaaa ggacatgatc tcattttttt 20880 ttatggttgc atagtattcc atagtgtgta tgtgccacat tttctttatc cagtctatca 20940 ctgatgggca tttgggttgg ttccaagtct ttgctattgt aaatagtgct acaataaaca 21000 tacatgtgct tgtgtcttta tagcagaatg atttataatc ctttgggtaa atacccagta 21060 atgggattgc tgggtcaaat ggtatttctg gttctagatc cctgaggaat caccttaagt 21120 gtttattcag ctcagtgaat tctgcatgtg tcccacacca gccaaccacc acccccatca 21180 agacagagga catttccagc ccctcagcca tccctgcatg tcccttgctg gtagagggag 21240 ggtttcctaa gtgcagatga aacttaataa gatgctggcc agcagattcc tgccccttcc 21300 ttgtcctcag gatgatgctg gaaaagaggg actcttcctc tctataaatg gggatgcacc 21360 tacccagccc ccgcttaggc tgctggccaa atcttgggac cttggtatgt ccacggctct 21420 gctgctgttc ttcctaccac tgaaaaagag tccaagaagg tggggacagt agcagaagag 21480 actttgccag gtcttgcaga tggggtacct tgatggggcc agcctttaga aggacagctt 21540 gccaggcctc gccagcctcc tgcccatgtg cagaaacctg aggtgccgac cccagcccac 21600 tgttgtgtga gcaggctgtg ctgatgaccc atttcccgtc cagcctgccc ttgtgctctg 21660 tgtgtgggct ctggggcagc agcgcctggg cactactgct gcagctgaac acttctgcat 21720 cctgccccga gtgagcctgg gctggggcca cagccaggca gaggcttccc agctgttctg 21780 atgttgaagc taagattgaa tgtagatgtg tctttaataa ttcaccccaa gtgtgttcct 21840 tcctagtctt gcgtccaacg gcatctccac agaaggagga aagagccttg cgagggccct 21900 gcagcagaac acgtctctag aaatactgtg gtaatagctc gagtcatttc atttgtttgt 21960 ttgtttttct gtgatagggt cttgctttgt cgtccaggct tgagtgcatt ggtgtgatct 22020 cagctcactg cagcctccac ctcccaggct cattcgaacc tcccgccttg gccttccgag 22080 tcctgagact ataggcatgc accaccacac ccagttaatt ttaaaatttt ttgtagagat 22140 ggggttttgc tatgttaccc aggctggtct tgaactcctg ggctcaagca gttctcctgc 22200 cctggcttct caaagccctg ggattgcagg tgtgagccac tgcacctggc acagagtcat 22260 tttggagggt ttaggtccca ggaattatcc caggggctgc acatggcctg gaatcttaac 22320 agaaaaggtg tctcccaatt ggaaaggctc taggcctttc agttaagttg ataatttcct 22380 cctagagaag agaatagcca cttctacaag cataaacagg tacaggagga ggaagtgggc 22440 tccgggagcc tggatctgag gccttggcct tctaggcccc aggagaacta gaacgctggc 22500 catgcaagct atccaggtat ccttggatac cttcagatgt gcttagcaga ggccaacttc 22560 cacacacttg gctcaaaatt ttctcccttc ctcctcttca tctgccttcc cccaggcagc 22620 ctcctccttc cccaggtctt cacatcaggg tttggccttt atgctccatc cagctcatct 22680 gtcacttgtc acctgaagcc cacagtcctc gctccctctc tgcactctag ggcacttact 22740 aagtggatgt ggcctcctga gagtgttttt tgttggtgtt ccctttttta tggccactta 22800 atgttttatt ttgctttatt tgtatttaca tctctgtatc ataaattcca tacaggtggc 22860 tgggagcagt gactcacatc tgtaatccca gtactttgga aggctgaggt gggaggatcg 22920 cttgaggcca agagttcgag actagcctgg gcaatatagc gagaccctct atctacaaaa 22980 aaaaaaaaca ttccttacag gttaagtgag ggagttgtat tacaaccctc cctatcatct 23040 actcagagcc cagtgctcat ttgatcttgc taaattagtt actgagaata atgacaatat 23100 cctcttcatg agagagtttt gacattaggc ctgctgtcca gtaagtgcat tttaaattct 23160 ttcccctcaa caaatcattt aacattttga aaagtagttt atgttttttg gaaaaaatgt 23220 aagacactaa aggaggacat gaaagtacct cctaaagttc ctgctaaaag gaggaagtga 23280 aagtacctcc ctttgtgttt tccaaaataa cctttccttt ctagcctttt gttctatgta 23340 tgttcaaaga tatgcaaaac agaatagcat tcaagcagtg gctctaaaaa tattgtaatc 23400 acatacttta catgtctcct ttagggtttc tccatcttga tgctgttgac attttggtcc 23460 aagtgattct ttattatggt agggctgtcc tgtgcatcat agacggttta gccgcatctc 23520 tgccctgtac ctcccagtgg tgaggatcaa aaatgcctcc ggacatggcc aggtgcccca 23580 tggagagtga aatcacatgg atagtagtaa tgtcaacacc tagaagccct caagtgctga 23640 ctgcatgcca tgtgttattc tacacttttt ccctgtgtta actcactcag cctcacaacc 23700 actctatacg atctctactg ttaacgttca ccagtgagaa aactcagacc caaagaactt 23760 aagcctgttg cccgaggtca ccctgctggt gggtgataca aacctgccca ggctgagtcc 23820 ggagtagatg tcaatgctgt gttcttctcc ctcctcattc tacctcattc tccctacaag 23880 ctgcacaaca tctcgaatag atatcacaat atatttcatc agttgtttct gatctaaatt 23940 tgttcagatt ttacattagg ataataccac aatgcatgct gcaatgtata aagctttgtg 24000 tgtatatcct tgcacactgt agggtaaatt tctagaagtc tgattgtctt aaaatgaagc 24060 acattaaaaa tttgggcagg cacatccaaa ctgcccttca aggaattttt ttttttaaat 24120 gttctttctg ttctattctt cttcctaatg attctttcgt ccactggcac aagtgggtcc 24180 taccctgttt acaccaagga gctttggtgc tttatccaga ccacttctgg ttctaaggac 24240 cattgagaga cttcctgaac tttcagtcac ttaacttggg tccctcacaa gttaactgag 24300 agcaaagtac tgaacacatt ttaatgtgca gtcagtgact gtttcaggtc ttcaaactaa 24360 cttggataac acactgtcag tggtgttcaa gggaccctgg gactagagga gaactgagaa 24420 gcaggcattg gccctttgtt ttccgtgggc ccccatcttc catgaaatct gagggctcag 24480 caaaggtggg gagggagggt gggctcctct acaggtagct gggctaagaa ataggagccc 24540 aggtacagga tttgcattaa aaatgagtcc cattgacctt ctgtggggct gacaggctgg 24600 gcttggagcc tggctgtttt ctgggttctc agcaagtgat catctgcata gctggagagc 24660 cttgggctga gctcccgctc ctgtgaactc taaaacaatg tctgccaagt aggctctctt 24720 gagtaaatac ttcctttttt ttccttaggc tgacccaaaa tgaactcaac gatgaagtgg 24780 cagagagttt ggcagaaatg ttgaaagtca accagacgtt aaagcattta tggtaactca 24840 gagagcctta caatttcaga ctgtgctact tttcaaaagt attttttgag ataaaattta 24900 catactgtaa aattcactct cttaaagtat acaattcaga ggtttttagt gcaaccatca 24960 ccacctaatt ctagaacatt ttcactcctc ctccccactc caaaaagccc tggtatccat 25020 taagcagtca ctccctgtcc tcctccccag accctggcaa ccactaatcc gctttctgtc 25080 tctatggatt tgcctactct gggcatttca tataaatgga atcaagcaat atgtgacctt 25140 ttgtctctgt gttctagcat gtttcattcc tttttatggc taaatgataa ttcactctaa 25200 ggaaattttg cagtttatta atcagttgat gggacatttg ggttgtttct actttttgac 25260 tattatgcgt aatgctactg tgaacactcc tgttcatgct tttgggtgaa catatgtttt 25320 catctctttt gggaatatac ctgggaatag aatttctggg tcatatggca attctgtaac 25380 tttttgagga gccaccaaac tgttttctaa agtggatgta ctattttaca ttctcgccag 25440 caatgtatgt ggattccaat ttctccacat cctcaccaac acttattatt gtccatcttt 25500 taaaatctag ttatactagt ggatgtgaag taatattgtg gttttgattt gcatttccct 25560 gatgacaaca atgttgaatg tctttttatg tgcctactgg gagtctgtat agcttctttg 25620 gagaaatgtc tccatatcct ttgcccattt taaaattggg tttgtcttct aatgctgagt 25680 tataggggtt ctctatatat tctgggtgct agacctttac tagatacagg ttttgcaagt 25740 attttctttc tttctgtgga gtttttcctc tttcttgata gtgaccttta aaggacaaca 25800 gtttttaatt tttgtttttt ttgagatgga gtcttgctct tgtcacccag acaggagtgc 25860 agtggcatga tctcagctct ctgcaacctc cacctcctgg gttcaagcga ttcttctgcc 25920 tcagcctcct gagtagttgg gattacaggc atcagccacc atgcctgtct cattttgtat 25980 ttttaataga gatggggttt caccatttag gcccaggctg gtcttgaact cctgacctca 26040 ggtgatccac ctgcctcagc ctcccaaagt gctgggatta caggcgaaaa gccactgcac 26100 ctggccaata gtttttaatt ttgatgaagt ccaatttatc tatttttttc tttggttgct 26160 tgtgctttca gtgtcttatc taagaaatga ttgcctaatc caagatcaca aagaactcca 26220 cctaagtttt ctgttaagcg ttatagttgt ttcccctcac atataggtct gcaatccatt 26280 ttgagttaat ttttgtatag tgtaaagtga gggttaacct cattctcttg cacgtggata 26340 tccagctgtc ccggcagcac cacgtgttga acagattatc ttttcctatt gaatggcctt 26400 gacacccttg tcaaaaatca attgaccata aatgtatggg tttatttctg aattctctgt 26460 tctggtccat tgatttatat gtctctccta tgccaggacc attgctgtag ctttgtgtag 26520 tacattttga aatcaggagg tgtgagttct actttgttct tctttctcaa gattgtttag 26580 accattctgg gttctttgca tttcttatga attcagactc accttgtcaa tttctgcaaa 26640 aagactagac tctgctacat attgtttttt ctttcctttt tagcctgcag aattatttga 26700 tcccattccc taagtgcagg ccagcctctc cagggagagc agagctagga cagggtcaga 26760 aagagagtct tggctgcttt gtgcattccc aacctgcact ggccctagtg aaggcagccc 26820 gagtgggtgg atgtgcctgg acactgcagg ctttttaggg gcattaggtg ctctccttcc 26880 tggcctcctg ccacatcttg gttggaggct gccttccctg ccttcaaaaa agcctaagtg 26940 gtgactagaa aacagcagag tgtaactgaa tacagaactt ggtgcccact tcctggttct 27000 atttttgtcc cttttgaaag ggaaggtcat tacctctgcc attgaaccca ggggccctag 27060 cccttgtggg gtatggctgg gagcaccaga tcctggctgc agcccagcca ccagtggtcc 27120 tgtgtgcttg ggcagtaaca gtgacaagag ctcccttccc cctggacact gtgcctaata 27180 ccctcctctt gaaatctcac acacccagtg gatggggggc actcttatag ttattctcag 27240 tttacagatg acacaactga ggcacagaca gatgcgttta tttcttcaag gttctgtagc 27300 tgaacagtgg ggagggaggg tttaagagga gctgcacccg ctctgcaata ctgcctctca 27360 cgagggagtc ctcttcattc atgacagcat agggccctcg tcttcctggt aagggcttcc 27420 ttcttgggtc agtgccagga tttctaaggg tcatgtttag caggagccta ttctacaaac 27480 agccaggagc agggaatgac tctgtgatga agcggagaca ctacagcctc ttgatgcatt 27540 tatttcctgg ttgggttaga agcgtagctg cccaagggag catttcagga gaggcctggc 27600 ttcctagcga tagctgaaaa ctttgtttca tttgaatcac tgctacccag aacaatgggg 27660 tgcattctca gagtccccat tattaaagct tttccactga gccccatgag aactattcat 27720 gagaactatt tcatggcagc ataactgttt ctcctccctc cctcttgcat gttggtagcc 27780 tcttaacttt aaaacctgcc ttgcctttcc ctagctacct ggaaggagac gtcagacttc 27840 ctgtcccatg gtgtgtttct tacaatttgt tgttcagatt ggtggtctcc caaatatata 27900 taaaaatata aatggagtct cactctgtca cccaggctgg agtgcagtgg cacgatcttg 27960 gctcactgca acctccacct cccagttcaa gcaattctcc tacctcagtc tcccgagtag 28020 ctgggagtac aggtgcacac caccatgccc agctaatttt tttgtatttt taatagagac 28080 aggttggcca ggatggtatc gatctcctga gctcgtgatc cacccacctc ggcctcccaa 28140 agtgctggga ttacaggtgt gagccactgc acccggcccc aaatattttg attatgcacc 28200 tctgcagtga aaaatgcaaa cacacacatc agttcatgta ttacattatg ttcactataa 28260 aaacaaacag aaaatttaaa aaatatcaag ctatccttta ctctagtgga tcttacctgg 28320 acacttttag ccagatacaa agtcacatgg actcagttct tcccctgacc aacttgtctc 28380 ttatcccaaa acacccttgc aactccctta cgaaggggtc aaatttgatc cagtattatg 28440 gattttatac aagttatgtt cttctttcag gcttatccag aatcagatca cagctaaggg 28500 gactgcccag ctggcagatg cgttacagag caacactggc ataacagaga tttggtaaga 28560 tcccagcgtt tgtcacagta ataacaccag tgactgttta ctcaccacca ctgactgtgc 28620 aaggcacaac gcagggtggt ttctgtttat tcctccagca accctgcaca gtaatggtat 28680 tacctctgtt ttacagaggt agacagaggc ccagaccagt gaaataaggt tgcccaaggt 28740 cactacgaga gaagctagaa ttcagcccag aatgcctgat tccatattct gtgctctcct 28800 gccctgggcc cccgccctca tctaccttca ttgggtggga tgggggaagt ggccagtgaa 28860 atgatttcct agtggaagta aatccccctg ggactcagca attgagagat gactgtgttg 28920 gccaggagtt tggagctcat tcttcccctt ttctgggttc cgtaagacat ttccaggctg 28980 acttgaactg acctgtgctc tttgtctact tcttttttct gctttgagaa cttccttatg 29040 ctaatagaag aaaaaaagtt tgctttactg tgacattgag cgccatgcca cttctttctt 29100 gcctcccata aggcacagac actccccact cagcagctcc cttaacaact taattgcctg 29160 ggtgacgtgg gactgggtgg atgctgggag aggggcctta ttaactatgt cctcctttca 29220 tgactgggga gaatttcata gccaattaaa aaaaaacaaa aaacagctcc ttggccaaca 29280 caggctcctc atacagtgtt ttttaaactt tgctttagaa cttgtttgga acttgtcata 29340 aaatcgatca gtttggtgaa ttgcaaccaa caatatttaa aaagaaaaca gaacagaaca 29400 aaatatcagg atgcaatgtg catggtatga aagtatcatt tcattcatct tagttcatgc 29460 ttgcatgtga gtgggtgtgt gtttgcataa gtgttggttc acaacataaa atgtaattct 29520 tatttagggt tgtagacaaa aggttttttt ttaaaaaaaa cactgttggc taggcatggt 29580 ggctcatgcc tgtaatacca gcactttggg aggccaagat gggcagatct cttgagcaca 29640 ggagtttgag accagcctgg gcaacatgcg aaaccccgtc actacaaaaa ttagcccgac 29700 atggtgctat gtgcctgtag tcccagctac tcaggaagct gatgtgggag gatggatgca 29760 tgggagatca aggctgcagt gagccaggat catgccactg caatccagcc tgggtgccag 29820 agaccctgtc tcaaaaaaca aaaaagaaaa aaagaaaaac accatcatag agaatagagc 29880 ccagatctaa acagacacct gtggcctgtg tgcctgcgaa gcccagcctg cccagcagcc 29940 tgggaagcac tggagggcac tggaactgtt tgcatgggtg tttgccctca ggccactccg 30000 tttctgctga ttcttaagtt ttgaggacag caggcagagg gggagaggaa ggagactgcc 30060 agactacaga acagtttgca gagcacagtt ggcttccact tttctctgta gctggtcagg 30120 cgggtagtaa agacctacag ttgctttaat tctgtcaagt ttcaaaatct gcattgcttc 30180 cctcttgagg gtcaccattc ctacacaagg aaccatttta gtagggccag gagacttcag 30240 cttcaaggcc tgcacttgtg tcagggtgga gaggggaact ggccaccaat tcagagaggg 30300 caggacaggc ggcatgggtg ctggtcttgg gagtgtcttc acttaggtcc ctggcttgtt 30360 ctgggagcct ccagagcatg ctcctctgtg tgtgacttca tgggactggg ctctgagaag 30420 gctgtggctt tgttggccct gccagggact gccacaccag gccacagggt tgtggttgag 30480 ctggccgggg agccacgttc agggagcagc tctgcttgga gccaacactt acagagtaag 30540 ccttctcctt ggacttgtta actgtactga cacttatttc tacctcattc ctttctgaaa 30600 ataacttgga agtctgaagt cccttgatga gttctgtctt taagaacaga aattagaggt 30660 gaacaatgaa cactgtaaat tacagaaatg tatcccactc cagtataaca gctttctgtg 30720 aggctatctc ctccagactg tggctctggg agggtggggc ctgagtcaag gtcctaggga 30780 ctagtgctgt gtcttcattt attccttgaa taacgaaacg cttgagcatc agggactgtg 30840 ctagcaccaa aaatccagtg gtgaacaaca tggcttcatg ggttcactgt ctagaaaggg 30900 agaagcacat taaagaaaaa atcatttgcg taattattta attacaactg tgatgggtac 30960 tatcacaaag gggaaggcca agagggaacc tgatttagat gaggttgcag ggaaggcctc 31020 tctgaggaag cagcacttac actaagccat gaaggatgaa taggagctag tcagctgagg 31080 tgagtattct gcgtagggaa cagcatgtgc aaagggtctg gggcaggagg gagtgtggtg 31140 tcctggaaga actgccagaa gctgctgtgc cccagggttc agacagtgtg gaagagggga 31200 ctacaggagg ctgaggagat aggcagggac tggaccataa aagatctgtg ggtcatgatg 31260 tgcattttgg tctttatcct aaaagtgatg gaaagtcagt gaacagtttg aagcaggaga 31320 ggcatgtgat cagatctgca atgcaaaaag accaattctt ggctcttcta ggaaactgaa 31380 ttggagaagg ccagagtacg tggaaatgac ctgtcagtag gacattgtac tgatgcaggg 31440 aagagatgat gggtgctcag accaagatgg ccggccaaag acatagaggt tccagggagg 31500 cattctagat tcttaggaat taggggagaa ctttgtgata caaggaacat ggggatgaga 31560 aggaaggtgt ccaggttgac ccccaggtta ctaacctgct cagcaggatg agagtggtcc 31620 attcactaag ccaggggacc ctaggaggtg tggctacttt gaggtgtggg ggagaggtcc 31680 aagtgaggat gccaagcagg taactgcctc cacggacata caaacaaggc cgtggcattg 31740 atgagatcgg gtggggaaaa gggcttagcc ccaaacctgg aggaaatctc agatgtagag 31800 gtcacatgga ggagaatata ggaaaggaaa ttgaagtaga gtgctcagat gcaggagaaa 31860 aatcagcgca tataaccaag ccaaggggag ggagtgcctc aagaaggagg gagaggagag 31920 gtcaggacag ccaaaatcct gagggccaag aaagacaaga cctggaaaat gtcattaaat 31980 tcaggcttat ggaggctaca ggtgacctta gtgagaccca gtgaacagag ggatggcagc 32040 tggagaggat ccatgctaat atgaaggaac tatctgcaaa gggtatgttc cttaatttca 32100 gggatacatg tgtattgtgt gatacacgag tgtgtgctat gaacacacct tgggaaggag 32160 tgtgcgagga tccttaacat tttacctgtg tacttttgtc ttcctccttt tcaacagcct 32220 aaatggaaac ctgataaaac cagaggaggc caaagtctat gaagatgaga agcggattat 32280 ctgtttctga gaggatgctt tcctgttcat ggggtttttg ccctggagcc tcagcagcaa 32340 atgccactct gggcagtctt ttgtgtcagt gtcttaaagg ggcctgcgca ggcgggacta 32400 tcaggagtcc actgcctcca tgatgcaagc cagcttcctg tgcagaaggt ctggtcggca 32460 aactccctaa gtacccgcta caattctgca gaaaaagaat gtgtcttgcg agctgttgta 32520 gttacagtaa atacactgtg aagagacttt attgcctatt ataattattt ttatctgaag 32580 ctagaggaat aaagctgtga gcaaacagag gaggccagcc tcacctcatt ccaacacctg 32640 ccatagggac caacgggagc gagttggtca ccgctctttt cattgaagag ttgaggatgt 32700 ggcacaaagt tggtgccaag cttcttgaat aaaacgtgtt tgatggatta gtattatacc 32760 tgaaatattt tcttccttct cagcactttc ccatgtattg atactggtcc cacttcacag 32820 ctggagacac cggagtatgt gcagtgtggg atttgactcc tccaaggttt tgtggaaagt 32880 taatgtcaag gaaaggatgc accacgggct tttaatttta atcctggagt ctcactgtct 32940 gctggcaaag atagagaatg ccctcagctc ttagctggtc taagaatgac gatgccttca 33000 aaatgctgct tccactcagg gcttctcctc tgctaggcta ccctcctcta gaaggctgag 33060 taccatgggc tacagtgtct ggccttggga agaagtgatt ctgtccctcc aaagaaatag 33120 ggcatggctt gcccctgtgg ccctggcatc caaatggctg cttttgtctc ccttacctcg 33180 tgaagagggg aagtctcttc ctgcctccca agcagctgaa gggtgactaa acgggcgcca 33240 agactcaggg gatcggctgg gaactgggcc agcagagcat gttggacacc ccccaccatg 33300 gtgggcttgt ggtggctgct ccatgagggt gggggtgata ctactagatc acttgtcctc 33360 ttgccagctc atttgttaat aaaatactga aaacactctt acgggttgag tctggagttt 33420 ttgaagggac ttggcttggt aagcactcat tgactcctga gccccatcct gattcactcc 33480 acagtgggga aggggctctg gggtgatgtg ctatgaggag agcctgatga aggccagggg 33540 tgtcaccagt ttgatccttc acaggcctct ctgcctacca agggacagga agcggctgtg 33600 gcagcctctg aggtctctcc atctggcctc tgaatctctt caggtggctt ctcagaggaa 33660 ataacttgtg agtagggggt ggctggtgcc aggacaggcc aagtgggcca aagttcatgc 33720 cttcatcacc atgccatggt agagcccacg ggccaggttc gacgtccact accttcctcg 33780 gctgttcact gctgagtggc ggatccaggt aggcccatgg caagaagcac cgagctgcca 33840 ggggcagcac gtgacagagg aaggcatgca gggcctccaa cggtccacct ctgagttctt 33900 atgagtccaa gcctggcttt gtagagcagc ctgttaggaa ggggaccgtt gcgggggaaa 33960 tcctgtacag ttaagcaact acaaggcggc agttccttaa a 34001 16 20 DNA Artificial Sequence Antisense Oligonucleotide 16 cctgacttac aatcacttgg 20 17 20 DNA Artificial Sequence Antisense Oligonucleotide 17 agcaacttgt cttcccagac 20 18 20 DNA Artificial Sequence Antisense Oligonucleotide 18 aattgcttct gtctcttcca 20 19 20 DNA Artificial Sequence Antisense Oligonucleotide 19 aacattgttt aaatcttcaa 20 20 20 DNA Artificial Sequence Antisense Oligonucleotide 20 gtcctctcag cagaagggca 20 21 20 DNA Artificial Sequence Antisense Oligonucleotide 21 ctgctcttcc atagttaaag 20 22 20 DNA Artificial Sequence Antisense Oligonucleotide 22 tctgatggga ttatttccat 20 23 20 DNA Artificial Sequence Antisense Oligonucleotide 23 ccagaagttc ccgattgctt 20 24 20 DNA Artificial Sequence Antisense Oligonucleotide 24 ttttgcggac cttgtcaggc 20 25 20 DNA Artificial Sequence Antisense Oligonucleotide 25 tgctgggtat acctgctcac 20 26 20 DNA Artificial Sequence Antisense Oligonucleotide 26 tcattgagga tgccggtggt 20 27 20 DNA Artificial Sequence Antisense Oligonucleotide 27 tcacccagga tgaagatggt 20 28 20 DNA Artificial Sequence Antisense Oligonucleotide 28 cagctcgtcc aggccatcga 20 29 20 DNA Artificial Sequence Antisense Oligonucleotide 29 tcaggtccaa gtccgagtgc 20 30 20 DNA Artificial Sequence Antisense Oligonucleotide 30 tagccccctt gagcagcttc 20 31 20 DNA Artificial Sequence Antisense Oligonucleotide 31 ctcgatgcct gtgcgggctg 20 32 20 DNA Artificial Sequence Antisense Oligonucleotide 32 tccagctggc tcagcaggcg 20 33 20 DNA Artificial Sequence Antisense Oligonucleotide 33 cggaagtgct ggaagcaccg 20 34 20 DNA Artificial Sequence Antisense Oligonucleotide 34 catcctgttc agatggacct 20 35 20 DNA Artificial Sequence Antisense Oligonucleotide 35 caccaggctg ctgggctgca 20 36 20 DNA Artificial Sequence Antisense Oligonucleotide 36 cacagagtgt cccggccggc 20 37 20 DNA Artificial Sequence Antisense Oligonucleotide 37 gccccagcga gcacagagtg 20 38 20 DNA Artificial Sequence Antisense Oligonucleotide 38 cctgcacctc ctcctgggtg 20 39 20 DNA Artificial Sequence Antisense Oligonucleotide 39 tgtcgtccag cacgaggaag 20 40 20 DNA Artificial Sequence Antisense Oligonucleotide 40 gtggtcgctg cccccgcagg 20 41 20 DNA Artificial Sequence Antisense Oligonucleotide 41 gcaggacgtg gtcgctgccc 20 42 20 DNA Artificial Sequence Antisense Oligonucleotide 42 tggaagtgat ccttgttctt 20 43 20 DNA Artificial Sequence Antisense Oligonucleotide 43 ccgcaggagt ttctgtttgg 20 44 20 DNA Artificial Sequence Antisense Oligonucleotide 44 gccttgcgct ttctcctcag 20 45 20 DNA Artificial Sequence Antisense Oligonucleotide 45 cccgcaggct ggaaaacagg 20 46 20 DNA Artificial Sequence Antisense Oligonucleotide 46 ttgcagtagg tcagcttgag 20 47 20 DNA Artificial Sequence Antisense Oligonucleotide 47 gggctgcagc tcccgcacgc 20 48 20 DNA Artificial Sequence Antisense Oligonucleotide 48 aacagtgagg cggctgaagc 20 49 20 DNA Artificial Sequence Antisense Oligonucleotide 49 tctggttgtt gtataaaccc 20 50 20 DNA Artificial Sequence Antisense Oligonucleotide 50 tttgctgttc ttcacagcca 20 51 20 DNA Artificial Sequence Antisense Oligonucleotide 51 ggttccgcag agcctctgcg 20 52 20 DNA Artificial Sequence Antisense Oligonucleotide 52 ggacgcaaga ctcagggtgg 20 53 20 DNA Artificial Sequence Antisense Oligonucleotide 53 ccctcgcaag gctctttcct 20 54 20 DNA Artificial Sequence Antisense Oligonucleotide 54 ctggataagc cataaatgct 20 55 20 DNA Artificial Sequence Antisense Oligonucleotide 55 aaatctctgt tatgccagtg 20 56 20 DNA Artificial Sequence Antisense Oligonucleotide 56 tccatttagg caaatctctg 20 57 20 DNA Artificial Sequence Antisense Oligonucleotide 57 ctcctctggt tttatcaggt 20 58 20 DNA Artificial Sequence Antisense Oligonucleotide 58 agcatcctct cagaaacaga 20 59 20 DNA Artificial Sequence Antisense Oligonucleotide 59 cagggcaaaa accccatgaa 20 60 20 DNA Artificial Sequence Antisense Oligonucleotide 60 gcctgcgcag gcccctttaa 20 61 20 DNA Artificial Sequence Antisense Oligonucleotide 61 gtctcttcac agtgtattta 20 62 20 DNA Artificial Sequence Antisense Oligonucleotide 62 gcctcctctg tttgctcaca 20 63 20 DNA Artificial Sequence Antisense Oligonucleotide 63 atgaggtgag gctggcctcc 20 64 20 DNA Artificial Sequence Antisense Oligonucleotide 64 caactttgtg ccacatcctc 20 65 20 DNA Artificial Sequence Antisense Oligonucleotide 65 gggaccagta tcaatacatg 20 66 20 DNA Artificial Sequence Antisense Oligonucleotide 66 tgcatccttt ccttgacatt 20 67 20 DNA Artificial Sequence Antisense Oligonucleotide 67 gccagcagac agtgagactc 20 68 20 DNA Artificial Sequence Antisense Oligonucleotide 68 agagctgagg gcattctcta 20 69 20 DNA Artificial Sequence Antisense Oligonucleotide 69 gaagcagcat tttgaaggca 20 70 20 DNA Artificial Sequence Antisense Oligonucleotide 70 tactcagcct tctagaggag 20 71 20 DNA Artificial Sequence Antisense Oligonucleotide 71 cagctgcttg ggaggcagga 20 72 20 DNA Artificial Sequence Antisense Oligonucleotide 72 gctggcccag ttcccagccg 20 73 20 DNA Artificial Sequence Antisense Oligonucleotide 73 ttaacaaatg agcgggcaag 20 74 20 DNA Artificial Sequence Antisense Oligonucleotide 74 tcagtatttt attaacaaat 20 75 20 DNA Artificial Sequence Antisense Oligonucleotide 75 agatacacac tcactcagtg 20 76 20 DNA Artificial Sequence Antisense Oligonucleotide 76 tttccaaagt cccaaatagg 20 77 20 DNA Artificial Sequence Antisense Oligonucleotide 77 ttttcccagt ctggctccga 20 78 20 DNA Artificial Sequence Antisense Oligonucleotide 78 tcccagctct tcactcagac 20 79 20 DNA Artificial Sequence Antisense Oligonucleotide 79 ggctccaagc ccagcctgtc 20 80 20 DNA Artificial Sequence Antisense Oligonucleotide 80 tcagcccaag gctctccagc 20 81 20 DNA Artificial Sequence Antisense Oligonucleotide 81 actcaagaga gcctacttgg 20 82 20 DNA Artificial Sequence Antisense Oligonucleotide 82 ttacaatcac tcagtgtcac 20 83 20 DNA Artificial Sequence Antisense Oligonucleotide 83 aagtctcctg acttacaatc 20 84 20 DNA Artificial Sequence Antisense Oligonucleotide 84 tacgctgagt ctgaaataaa 20 85 20 DNA Artificial Sequence Antisense Oligonucleotide 85 tttccaaagt ctgggttgaa 20 86 20 DNA Artificial Sequence Antisense Oligonucleotide 86 ccaggatttt ggtgacgtac 20 87 20 DNA Artificial Sequence Antisense Oligonucleotide 87 ggacgcaaga ctaggaagga 20 88 20 DNA Artificial Sequence Antisense Oligonucleotide 88 cgagctatta ccacagtatt 20 89 20 DNA Artificial Sequence Antisense Oligonucleotide 89 ctgaaggtat ccaaggatac 20 90 20 DNA Artificial Sequence Antisense Oligonucleotide 90 actttcatac catgcacatt 20 91 20 DNA Artificial Sequence Antisense Oligonucleotide 91 ccaacactta tgcaaacaca 20 92 20 DNA Artificial Sequence Antisense Oligonucleotide 92 caaactgttc actgactttc 20 93 20 DNA Artificial Sequence Antisense Oligonucleotide 93 tccatttagg ctgttgaaaa 20 94 3080 DNA Homo sapiens 94 cacgcgtccg acttgctgaa gaatgactac ttctcggccg aagatgcgga gattgtgtgt 60 gcctgcccca cccagcctga caaggtccgc aaaattctgg acctggtaca gagcaagggc 120 gaggaggtgt ccgagttctt cctctacttg ctccagcaac tcgcagatgc ctacgtggac 180 ctcaggcctt ggctgctgga gatcggcttc tccccttccc tgctcactca gagcaaagtc 240 gtggtcaaca ctgacccagt gagcaggtat acccagcagc tgcgacacca tctgggccgt 300 gactccaagt tcgtgctgtg ctatgcccag aaggaggagc tgctgctgga ggagatctac 360 atggacacca tcatggagct ggttggcttc agcaatgaga gcctgggcag cctgaacagc 420 ctggcctgcc tcctggacca caccaccggc atcctcaatg agcagggtga gaccatcttc 480 atcctgggtg atgctggggt gggcaagtcc atgctgctac agcggctgca gagcctctgg 540 gccacgggcc ggctagacgc aggggtcaaa ttcttcttcc actttcgctg ccgcatgttc 600 agctgcttca aggaaagtga caggctgtgt ctgcaggacc tgctcttcaa gcactactgc 660 tacccagagc gggaccccga ggaggtgttt gccttcctgc tgcgcttccc ccacgtggcc 720 ctcttcacct tcgatggcct ggacgagctg cactcggact tggacctgag ccgcgtgcct 780 gacagctcct gcccctggga gcctgcccac cccctggtct tgctggccaa cctgctcagt 840 gggaagctgc tcaagggggc tagcaagctg ctcacagccc gcacaggcat cgaggtcccg 900 cgccagttcc tgcggaagaa ggtgcttctc cggggcttct cccccagcca cctgcgcgcc 960 tatgccagga ggatgttccc cgagcgggcc ctgcaggacc gcctgctgag ccagctggag 1020 gccaacccca acctctgcag cctgtgctct gtgcccctct tctgctggat catcttccgg 1080 tgcttccagc acttccgtgc tgcctttgaa ggctcaccac agctgcccga ctgcacgatg 1140 accctgacag atgtcttcct cctggtcact gaggtccatc tgaacaggat gcagcccagc 1200 agcctggtgc agcggaacac acgcagccca gtggagaccc tccacgccgg ccgggacact 1260 ctgtgctcgc tggggcaggt ggcccaccgg ggcatggaga agagcctctt tgtcttcacc 1320 caggaggagg tgcaggcctc cgggctgcag gagagagaca tgcagctggg cttcctgcgg 1380 gctttgccgg agctgggccc cgggggtgac cagcagtcct atgagttttt ccacctcagc 1440 ctcctcacct gtaaaactgg gatcccagta tagactttgg aaatcagtag acaccatatg 1500 cttcaaaaaa caggggctat taaaatgaca tcaggagcca gaaagtctca tggctgtgct 1560 ttctcttgaa gtttatacaa caaccagatc accgatgtcg gagccagact gggaaaaaac 1620 aaaataacaa gtgaaggagg gaagtatctc gccctggctg tgaagaacag caaatcaatc 1680 tctgaggttg ggatgtgggg caatcaagtt ggggatgaag gagcaaaagc cttcgcagag 1740 gctctgcgga accaccccag cttgaccacc ctgagtcttg cgtccaacgg catctccaca 1800 gaaggaggaa agagccttgc gagggccctg cagcagaaca cgtctctaga aatactgtgg 1860 ctgacccaaa atgaactcaa cgatgaagtg gcagagagtt tggcagaaat gttgaaagtc 1920 aaccagacgt taaagcattt atggcttatc cagaatcaga tcacagtctt ttgtgtcagt 1980 gtcttaaagg ggcctgcgca ggcgggacta tcaggagtcc actgcctcca tgatgcaagc 2040 cagcttcctg tgcagaaggt ctggtcggca aactccctaa gtacccgcta caattctgca 2100 gaaaaagaat gtgtcttgcg agctgttgta gttacagtaa atacactgtg aagagacttt 2160 attgcctatt ataattattt ttatctgaag ctagaggaat aaagctgtga gcaaacagag 2220 gaggccagcc tcacctcatt ccaacacctg ccatagggac caacgggagc gagttggtca 2280 ccgctctttt cattgaagag ttgaggatgt ggcacaaagt tggtgccaag cttcttgaat 2340 aaaacgtgtt tgatggatta gtattatacc tgaaatattt tcttccttct cagcactttc 2400 ccatgtattg atactggtcc cacttcacag ctggagacac cggagtatgt gcagtgtggg 2460 atttgactcc tccaaggttt tgtggaaagt taatgtcaag gaaaggatgc accacgggct 2520 tttaatttta atcctggagt ctcactgtct gctggcaaag atagagaatg ccctcagctc 2580 ttagctggtc taagaatgac gatgccttca aaatgctgct tccactcagg gcttctcctc 2640 tgctaggcta ccctcctcta gaaggctgag taccatgggc tacagtgtct ggccttggga 2700 agaagtgatt ctgtccctcc aaagaaatag ggcatggctt gcccctgtgg ccctggcatc 2760 caaatggctg cttttgtctc ccttacctcg tgaagagggg aagtctcttc ctgcctccca 2820 agcagctgaa gggtgactaa acgggcgcca agactcaggg gatcggctgg gaactgggcc 2880 agcagagcat gttggacacc ccccaccatg gtgggcttgt ggtggctgct ccatgagggt 2940 gggggtgata ctactagatc acttgtcctc ttgccagctc atttgttaat aaaatactga 3000 aaacacaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3060 aaaaaaaaaa aaaaaaaaaa 3080 95 4302 DNA Homo sapiens unsure 72 unknown 95 cccgcgtccg cgtccccgga ccatggcgct ctccgggctc ttctctagct ctcagcggct 60 gcgaagtctg tnaacctggt ggccaagtga ttgtaagtca ggagactttc cttcggtttc 120 tgcctttgat ggcaagaggt ggagattgtg gcggcgatta cagaaaacat ctgggaagac 180 aagttgctgt ttttatggga atcgcaggct tggaagagac agaagcaatt ccagaaataa 240 attggaaatt gaagatttaa acaatgttgt tttaaaatat tctaacttca aagaatgatg 300 ccagaaactt aaaaaggggc tgcgcagagt agcaggggcc ctggagggcg cggcctgaat 360 cctgattgcc cttctgctga gaggacacac gcagctgaag atgaatttgg gaaaagtagc 420 cgcttgctac tttaactatg gaagagcagg gccacagtga gatggaaata atcccatcag 480 agtctcaccc ccacattcaa ttactgaaaa gcaatcggga acttctggtc actcacatcc 540 gcaatactca gtgtctggtg gacaacttgc tgaagaatga ctacttctcg gccgaagatg 600 cggagattgt gtgtgcctgc cccacccagc ctgacaaggt ccgcaaaatt ctggacctgg 660 tacagagcaa gggcgaggag gtgtccgagt tcttcctcta cttgctccag caactcgcag 720 atgcctacgt ggacctcagg ccttggctgc tggagatcgg cttctcccct tccctgctca 780 ctcagagcaa agtcgtggtc aacactgacc cagtgagcag gtatacccag cagctgcgac 840 accatctggg ccgtgactcc aagttcgtgc tgtgctatgc ccagaaggag gagctgctgc 900 tggaggagat ctacatggac accatcatgg agctggttgg cttcagcaat gagagcctgg 960 gcagcctgaa cagcctggcc tgcctcctgg accacaccac cggcatcctc aatgagcagg 1020 ctgcttcaag gaaagtgaca ggctgtgtct gcaggacctg ctcttcaagc actactgcta 1080 cccagagcgg gaccccgagg aggtgtttgc cttcctgctg cgcttccccc acgtggccct 1140 cttcaccttc gatggcctgg acgagctgca ctcggacttg gacctgagcc gcgtgcctga 1200 cagctcctgc ccctgggagc ctgcccaccc cctggtcttg ctggccaacc tgctcagtgg 1260 gaagctgctc aagggggcta gcaagctgct cacagcccgc acaggcatcg aggtcccgcg 1320 ccagttcctg cggaagaagg tgcttctccg gggcttctcc cccagccacc tgcgcgccta 1380 tgccaggagg atgttccccg agcgggccct gcaggaccgc ctgctgagcc agctggaggc 1440 caaccccaac ctctgcagcc tgtgctctgt gcccctcttc tgctggatca tcttccggtg 1500 cttccagcac ttccgtgctg cctttgaagg ctcaccacag ctgcccgact gcacgatgac 1560 cctgacagat gtcttcctcc tggtcactga ggtccatctg aacaggatgc agcccagcag 1620 cctggtgcag cggaacacac gcagcccagt ggagaccctc cacgccggcc gggacactct 1680 gtgctcgctg gggcaggtgg cccaccgggg catggagaag agcctctttg tcttcaccca 1740 ggaggaggtg caggcctccg ggctgcagga gagagacatg cagctgggct tcctgcgggc 1800 tttgccggag ctgggccccg ggggtgacca gcagtcctat gagtttttcc acctcaccct 1860 ccaggccttc tttacagcct tcttcctcgt gctggacgac agggtgggca ctcaggagct 1920 gctcaggttc ttccaggagt ggatgccccc tgcgggggca gcgaccacgt cctgctatcc 1980 tcccttcctc ccgttccagt gcctgcaggg cagtggtccg gcgcgggaag acctcttcaa 2040 gaacaaggat cacttccagt tcaccaacct cttcctgtgc gggctgttgk ccaaagccaa 2100 acagaaactc ctgcggcatc tggtgcccgc ggcagccctg aggagaaagc gcaaggccct 2160 gtgggcacac ctgttttcca gcctgcgggg ctacctgaag agcctgcccc gcgttcaggt 2220 cgaaagcttc aaccaggtgc aggccatgcc cacgttcatc tggatgctgc gctgcatcta 2280 cgagacacag agccagaagg tggggcagct ggcggccagg ggcatctgcg ccaactacct 2340 caagctgacc tactgcaacg cctgctcggc cgactgcagc gccctctcct tcgtcctgca 2400 tcacttcccc aagcggctgg ccctagacct agacaacaac aatctcaacg actacggcgt 2460 gcgggagctg cagccctgct tcagccgcct cactgttctc agactcagcg taaaccagat 2520 cactgacggt ggggtaaagg tgctaagcga agagctgacc aaatacaaaa ttgtgaccta 2580 tttgggttta tacaacaacc agatcaccga tgtcggagcc aggtacgtca ccaaaatcct 2640 ggatgaatgc aaaggcctca cgcatcttaa actgggaaaa aacaaaataa caagtgaagg 2700 agggaagtat ctcgccctgg ctgtgaagaa cagcaaatca atctctgagg ttgggatgtg 2760 gggcaatcaa gttggggatg aaggagcaaa agccttcgca gaggctctgc ggaaccaccc 2820 cagcttgacc accctgagtc ttgcgtccaa cggcatctcc acagaaggag gaaagagcct 2880 tgcgagggcc ctgcagcaga acacgtctct agaaatactg tggctgaccc aaaatgaact 2940 caacgatgaa gtggcagaga gtttggcaga aatgttgaaa gtcaaccaga cgttaaagca 3000 tttatggctt atccagaatc asatcacagc twargggact gcccagctgg cagatgcgtt 3060 acagagcaac actggcataa cagagatttg cctaaatgga aacctgataa aaccagagga 3120 ggccaaagtc tatgaagatg agaagcggat tatctgtttc tgagaggatg ctttcctgtt 3180 catggggttt ttgccctgga gcctcagcag caaatgccac tytgggcagt cttttgtgtc 3240 agtgtcttaa aggggcctgc gcaggcggga ctatcaggag tccactgcct ccatgatgca 3300 agccagcttc ctgtgcagaa ggtctggtcg gcaaactccc taagtacccg ctacaattct 3360 gcagaaaaag aatgtgtctt gcgagctgtt gtagttacag taaatacact gtgaagagac 3420 tttattgcct attataatta tttttatctg aagctagagg aataaagctg tgagcaaaca 3480 gaggaggcca gcctcacctc attccaacac ctgccatagg gaccaacggg agcgagttgg 3540 tcaccgctct tttcattgaa gagttgagga tgtggcacaa agttggtgcc aagcttcttg 3600 aataaaacgt gtttgatgga ttagtattat acctgaaata ttttcttcct tctcagcact 3660 ttcccatgta ttgatactgg tcccacttca cagctggaga caccggagta tgtgcagtgt 3720 gggatttgac tcctccaagg ttttgtggaa agttaatgtc aaggaaagga tgcaccacgg 3780 gcttttaatt ttaatcctgg agtctcactg tctgctggca aagatagaga atgccctcag 3840 ctcttagctg gtctaagaat gacgatgcct tcaaaatgct gcttccactc agggcttctc 3900 ctctgctagg ctaccctcct ctagaaggct gagtaccatg ggctacagtg tctggccttg 3960 ggaagaagtg attctgtccc tccaaagaaa tagggcatgg cttgcccctg tggccctggc 4020 atccaaatgg ctgcttttgt ctcccttacc tcgtgaagag gggaagtctc ttcctgcctc 4080 ccaagcagct gaagggtgac taaacgggcg ccaagactca ggggatcggc tgggaactgg 4140 gccagcagag catgttggac accccccacc atggtgggct tgtggtggct gctccatgag 4200 ggtgggggtg atactactag atcacttgtc ctcttgccag ctcatttgtt aataaaatac 4260 tgaaaaccca aaaaaaaaaa aaaaaaaaaa aaaaaagggc gg 4302 96 1400 DNA Homo sapiens unsure 1394 unknown 96 cacgcgtccg cgctactgcg ggagcagcgt cctcccgggc cacggcgctt cccggccccg 60 gcgtccccgg accatggcgc tctccgggct cttctctagc tctcagcggc tgcgaagtct 120 gtaaacctgg tggccaagtg attgtaagtc aggagacttt ccttcggttt ctgcctttga 180 tggcaagagg tggagattgt ggcggcgatt acagaaaaca tctgggaaga caagttgctg 240 tttttatggg aatcgcaggc ttggaagaga cagaagcaat tccagaaata aattggaaat 300 tgaagattta aacaatgttg ttttaaaata ttctaacttc aaagaatgat gccagaaact 360 taaaaagggg ctgcgcagag tagcaggggc cctggagggc gcggcctgaa tcctgattgc 420 ccttctgctg agaggacaca cgcagctgaa gatgaatttg ggaaaagtag ccgcttgcta 480 ctttaactat ggaagagcag ggccacagtg agatggaaat aatcccatca gagtctcacc 540 cccacattca attactgaaa agcaatcggg aacttctggt cactcacatc cgcaatactc 600 agtgtctggt ggacaacttg ctgaagaatg actacttctc ggccgaagat gcggagattg 660 tgtgtgcctg ccccacccag cctgacaagg tccgcaaaat tctggacctg gtacagagca 720 agggcgagga ggtgtccgag ttcttcctct acttgctcca gcaactcgca gatgcctacg 780 tggacctcag gccttggctg ctggagatcg gcttctcccc ttccctgctc actcagagca 840 aagtcgtggt caacactgac ccaggtagga gtcagcccca gcaagaccgc aggcaccagt 900 gcaagcaggg ccctgggggg tttggtaatg gctgggccag ccctgagtgc cacctcagga 960 agcaggccca ggtgctattt tgattttaga aaggaacagc tgaatcctgt ctcccaagtg 1020 cagcccaggt ggctgcgatt gaactgccca cacctcgatg gtctggttta tagaggggcc 1080 tttggaagta tgggaatggc ctgtgttctg accccttgct ttcttcctat tctgacatat 1140 gtagacattt taatggttgc acaaattcaa ggttgtattt ttttttcttt aaaaaaatct 1200 ttagctggac atggtagcac acacctgtag ttccagctac tcaggaggct gaggcaagag 1260 gactgcttga gccccagagt ctaaggctgc agcgagctat gattgtgccc ctacactcca 1320 cagcctgggt tttagagtga gaccctgtct ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380 aaaaaaaaaa aaangggcgg 1400 

What is claimed is:
 1. A compound 8 to 50 nucleobases in length targeted to a nucleic acid molecule encoding NOD1, wherein said compound specifically hybridizes with said nucleic acid molecule encoding NOD1 and inhibits the expression of NOD1.
 2. The compound of claim 1 which is an antisense oligonucleotide.
 3. The compound of claim 2 wherein the antisense oligonucleotide has a sequence comprising SEQ ID NO: 17, 20, 22, 23, 26, 27, 28, 29, 30, 31, 34, 35, 36, 37, 38, 41, 42, 45, 46, 50, 51, 53, 54, 55, 57, 58, 59, 61, 62, 63, 65, 66, 68, 71, 72, 79, 82, 83, 85, 86, 87, 88, 90 or
 91. 4. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified internucleoside linkage.
 5. The compound of claim 4 wherein the modified internucleoside linkage is a phosphorothioate linkage.
 6. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified sugar moiety.
 7. The compound of claim 6 wherein the modified sugar moiety is a 2′-O-methoxyethyl sugar moiety.
 8. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified nucleobase.
 9. The compound of claim 8 wherein the modified nucleobase is a 5-methylcytosine.
 10. The compound of claim 2 wherein the antisense oligonucleotide is a chimeric oligonucleotide.
 11. A compound 8 to 50 nucleobases in length which specifically hybridizes with at least an 8-nucleobase portion of an active site on a nucleic acid molecule encoding NOD1.
 12. A composition comprising the compound of claim 1 and a pharmaceutically acceptable carrier or diluent.
 13. The composition of claim 12 further comprising a colloidal dispersion system.
 14. The composition of claim 12 wherein the compound is an antisense oligonucleotide.
 15. A method of inhibiting the expression of NOD1 in cells or tissues comprising contacting said cells or tissues with the compound of claim 1 so that expression of NOD1 is inhibited.
 16. A method of treating an animal having a disease or condition associated with NOD1 comprising administering to said animal a therapeutically or prophylactically effective amount of the compound of claim 1 so that expression of NOD1 is inhibited.
 17. The method of claim 16 wherein the disease or condition arises from abberant apoptosis.
 18. The method of claim 16 wherein the disease or condition is a hyperproliferative disease.
 19. The compound of claim 1 targeted to a nucleic acid molecule encoding NOD1, wherein said compound specifically hybridizes with and differentially inhibits the expression of one of the variants of NOD1 relative to the remaining variants of NOD1.
 20. The compound of claim 19 targeted to a nucleic acid molecule encoding NOD1, wherein said compound hybridizes with and specifically inhibits the expression of a variant of NOD1, wherein said variant is selected from the group consisting of CARD4-L, CARD4-S, CARD4-X, CARD4-Y and CARD4-Z. 